Building a full-text search engine in 150 lines of Python code · Bart de Goede
This is pretty interesting post. It uses lots of small algorithms and provide simple code-snippets for them:
- tokenzing
- stemming
- boolean search
- term frequency
- inverse document frequency
All in just 150 lines of code.