Building a full-text search engine in 150 lines of Python code · Bart de Goede

This is pretty interesting post. It uses lots of small algorithms and provide simple code-snippets for them:

  • tokenzing
  • stemming
  • boolean search
  • term frequency
  • inverse document frequency

All in just 150 lines of code.

Pratyush Mittal @pratyush