Projects

Daily Deals and eCommerce Analysis

In Bonobo Analytics, I built this entire product from the ground up. This product fetched daily updates of products sold on Daily Deals sites like Groupon, LivingSocial, etc., and source products…

Voice of the Market: Sentiment Analysis Engine

Voice of the Market (VOOM) is a sentiment/opinion analysis product which can analyse how positive/neutral/negative a consumer product. It uses the scraped data fetched from numerous websites like review sites,…

Extensible, Distributed, Scalable Crawler

Developed an extensible, distributed, scalable, bandwidth efficient crawler capable of scraping and extracting various entities from review sites, blogs, forums, social networks, etc. This was a time before the Scrapy project was started and Nutch was in it’s infancies.

Tech: Python 2.6, BeautifulSoup, Python MultiProcessing, MemcacheDB, PostgreSQL & Greenplum.

MontyLingua3

MontyLingua is a popular Natural Language processing tool which can understand english language and also correct it based on common sense – used by many people around the world for research. It did have a small problem – licensing.