I’ve been looking for a good solution to manage full text search for large data chunks (2GB - 100GB). I’ve written a couple of solutions using Xapian with limited success, but unfortunately I haven’t been satisfied with it overall. Performance was good, but there were a number of issues with flexibility that have me ultimately looking for another solution.
At my usual day job, the topic was brought up and I mentioned Xapian and Lucene as solutions, however we’re looking to stay away from Java as it’s not currently in our architecture, and as I stated before: Xapian doesn’t quite have the capabilities I’m looking for to handle even my own systems. Someone brought up Sphinx as something that was being looked into, and I jumped into the typical research process.
One of the key elements that Sphinx seems to offer …
[Read more]