It is not seldom that I encounter the following question while marketing a solution: “What are the hypothesis you are testing for and how will you prove or disprove it?”, thereby alluding to standard hypothesis testing techniques. In this context, I had a short conversation with a friend a couple of days back on the prevalent culture in many analytics organisations and academic
Data Science in Retail…. Problems to solve.
In this post, I’ll cover, as of last year, some NLP tools available and the features they offer – in terms of the basic components that have been described thus far. Additionally, the tables also talk about Named Entity Recognizers and Correference Resolution; these will be covered in the next post. Table 1: Open source tools for basic NLP
This is Part 3 of the NLP Pipelines Series. View Part 1 and Part 2. In this post, I cover a few more basic components required for building an NLP System. Parsing Chunking. Chunking is often regarded as the preprocessing step for parsing. It involves the recognition of non-overlapping phrases belonging to different syntactic categories. Originally, chunking referred
Abstract Social media has proliferated the online advertising space with its revenue set to grow from 5.1B USD in 2013 to 15B USD in 2018, with a YoY growth rate of about 24%  that assists advertisers to listen to, analyze, and interact with customers on social media. With this backdrop, this case-study introduces several major considerations in addressing
This is Part 2 of the NLP Pipelines Series. View Part 1 In the next few posts, I will cover a few basic components of an NLP system from the perspectives of what’s available out there in the form of open-source tools and data (for training such components), how the performance of the components are “measured”, and what the state-of-art
Abstract The availability of large-scale textual data through disparate sources like the digitized libraries, social media, and the internet has made automated techniques to process text imperative. “Natural language processing” (NLP) refers to the functioning of software and hardware components in a computer system which analyze or synthesize spoken or written language. Several reports in the