Files in the top-level directory from the latest check-in of branch trunk
- go
- prototype
- readme.md
Search Engine
Is it possible to run a simple search engine without a billion dollars?
Inspiration & philosophy
Prototyping a personal search engine > Observations that make helpful for a successful prototype I don't need to crawl the whole web just the pages I'm interested. I don't need to create or monitor a recursive web crawler I avoid junk because I'm curating the sources through my existing web reading practices I am targeting a small search corpus, approximately 100,000 pages or so. I am only indexing HTML and plain text