Texonom
Texonom
/
Engineering
Engineering
/Software Engineering/Programming/Programming Language/Programming Languages/4GL/
Python
Python
/Python Grammar/Python Object/Python Type/python string/
strinzIlla
Search

strinzIlla

Creator
Creator
Seonglae Cho
Created
Created
2023 Jul 13 8:58
Editor
Editor
Seonglae Cho
Edited
Edited
2023 Aug 30 16:0
Refs
Refs
Python pathlib readlines
StringZilla
ashvardanian • Updated 2023 Aug 30 15:59
 
 
 
 
 
Stringzilla - Stupid Heuristics to Search & Sort Strings 5-10x Faster
A few years ago, I used a trivial heuristic in combination with SIMD intrinsics to showcase the untapped potential of modern CPUs. I benchmarked the qsort of LibC and the std::search of the C++ Standard Templates Library, resulting in ~1.5 GB/s throughput for substring search on a single core. Not bad, but the memory bandwidth would be closer to 10-15 GB/s per core. I’ve assumed that if the first 4 characters of the string match, the rest is also likely to match.
Stringzilla - Stupid Heuristics to Search & Sort Strings 5-10x Faster
https://ashvardanian.com/posts/stringzilla/
Stringzilla - Stupid Heuristics to Search & Sort Strings 5-10x Faster
 
 

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Software Engineering/Programming/Programming Language/Programming Languages/4GL/
Python
Python
/Python Grammar/Python Object/Python Type/python string/
strinzIlla
Copyright Seonglae Cho