The prodigalities from scientific-coder

prodigalities's Introduction

This repository hosts code for my "Serendipitous Prodigalities" blog.

#1. A little theory and a small case study on optimizing code for run-time efficiency.#

We will analyse code developpement for a language identification program using ngrams. The focus will solely be on run-time efficiency, not language classification accuracy.

##1.1. Using domain-specific knowledge UTF-8, UTF-32 and custom storage for n-grams representations.##

###1.1.1. First implementations###

cf. ngrams_counter_[utf8|utf32|bitfields].hxx

###1.1.2. Profiler-driven tuning###

Using pref, ad cache/callgrind cf. BASIC_UTF8_CMP, CUSTOM_ARRAY_ROTATE

##1.2. Using domain-specific knowledge : standard unordered_map and custom perfect hashing for n-gram matching.## TODO, using gperf, llvm switch statement

##1.3. Parallelization granularity : processes/threads, intra/inter files## TODO, using GNU parallel and OpenMP (+ CPU affinity) or std::threads

Recommend Projects

scientific-coder / prodigalities Goto Github PK

prodigalities's Introduction

prodigalities's People

Contributors

Stargazers

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs