dimanche 18 janvier 2015

Good practices for real time search with Elasticsearch

Elasticsearch good practices: introduction

As promised in my first articlethis is the start of my series of posts about the good practices when using Elasticsearch. These tips will come from the Elasticsearch documentation, other blogs and my own experience.

What is Elasticsearch?

Elasticsearch ( ES) is a real-time scalable open-source search engine built upon Lucene.  The near-real time property means that the indexed documents are searchable seconds only after having been uploaded. Scalable, Elasticsearch splits the information into shards and replicas that are automatically shared between the nodes of the cluster. ES is becoming more and more popular since it is easy to use, intuitive and well-documented. You can interact with Elasticsearch with any language able to make HTTP requests since it is REST-based.
After having discovered Elasticsearch, I was especially amazed by the speed of the searches: using like Lucene inverted indexes, finding the relevant documents in the Big Data haystack is child play.
Now that you've had a small introduction to ES, stay tuned for the next parts of the series!