Based on some recent conversations, I realized that text preprocessing is a severely overlooked topic. A few people I spoke to mentioned inconsistent results from their NLP applications only to realize that they were not preprocessing their text or were using the wrong kind of text preprocessing for their project. With that in mind, I thought of shedding some light around … [Read more...] about Getting Started with Text Preprocessing for Machine Learning & NLP
Natural Language Processing
Building NLP Classifiers Cheaply With Transfer Learning and Weak Supervision
Introduction There is a catch to training state-of-the-art NLP models: their reliance on massive hand-labeled training sets. That’s why data labeling is usually the bottleneck in developing NLP applications and keeping them up-to-date. For example, imagine how much it would cost to pay medical specialists to label thousands of electronic health records. In general, having … [Read more...] about Building NLP Classifiers Cheaply With Transfer Learning and Weak Supervision
OpenAI’s GPT-2: Results, Hype, and Controversies
The nonprofit AI research company, OpenAI, recently released a new language model, called GPT-2, which is capable of generating realistic texts in a wide range of styles. In fact, the company stated that the model is so good at automatic text generation that it can be used for nefarious purposes; therefore, it did not publicize the trained model. The dangerous-to-release … [Read more...] about OpenAI’s GPT-2: Results, Hype, and Controversies
Why Bots Are So Difficult to Get Right
Most of today’s biggest companies have begun experimenting with chatbots, and many others are poised to follow suit. A recent survey found 56% of service leaders were actively looking for ways to integrate artificial intelligence, including chatbots. This surge in interest reflects the significant benefits this technology can offer: bots can save human agents time by providing … [Read more...] about Why Bots Are So Difficult to Get Right
New Approaches For Leveraging NLP in Marketing & Advertising
Natural Language Processing (NLP) is one of the longest-standing areas of AI research. The idea of being able to speak to a computer and be understood, whether verbally or in writing, has been around for as long as the idea of artificial intelligence. These days, NLP has gone far beyond being merely a better input method - we’re able to use machine learning algorithms to … [Read more...] about New Approaches For Leveraging NLP in Marketing & Advertising
Sentiment Analysis: Types, Tools, and Use Cases
What do you do before purchasing something that costs more than a pack of gum? Whether you want to treat yourself to new sneakers, a laptop, or an overseas tour, processing an order without checking out similar products or offers and reading reviews doesn’t make much sense any more. Thanks to comment sections on eCommerce sites, social nets, review platforms, or dedicated … [Read more...] about Sentiment Analysis: Types, Tools, and Use Cases
Generalized Language Models: Common Tasks & Datasets
EDITOR'S NOTE: Generalized Language Models is an extensive four-part series by Lillian Weng of OpenAI. Part 1: CoVe, ELMo & Cross-View TrainingPart 2: ULMFiT & OpenAI GPTPart 3: BERT & OpenAI GPT-2Part 4: Common Tasks & Datasets Do you find this in-depth technical education about language models and NLP applications to be useful? Subscribe below to … [Read more...] about Generalized Language Models: Common Tasks & Datasets
Generalized Language Models: BERT & OpenAI GPT-2
EDITOR'S NOTE: Generalized Language Models is an extensive four-part series by Lillian Weng of OpenAI. Part 1: CoVe, ELMo & Cross-View TrainingPart 2: ULMFiT & OpenAI GPTPart 3: BERT & OpenAI GPT-2Part 4: Common Tasks & Datasets Do you find this in-depth technical education about language models and NLP applications to be useful? Subscribe below to … [Read more...] about Generalized Language Models: BERT & OpenAI GPT-2
Generalized Language Models: ULMFiT & OpenAI GPT
EDITOR'S NOTE: Generalized Language Models is an extensive four-part series by Lillian Weng of OpenAI. Part 1: CoVe, ELMo & Cross-View TrainingPart 2: ULMFiT & OpenAI GPTPart 3: BERT & OpenAI GPT-2Part 4: Common Tasks & Datasets Do you find this in-depth technical education about language models and NLP applications to be useful? Subscribe below to … [Read more...] about Generalized Language Models: ULMFiT & OpenAI GPT
Generalized Language Models: CoVe, ELMo & Cross-View Training
EDITOR'S NOTE: Generalized Language Models is an extensive four-part series by Lillian Weng of OpenAI. Part 1: CoVe, ELMo & Cross-View TrainingPart 2: ULMFiT & OpenAI GPTPart 3: BERT & OpenAI GPT-2Part 4: Common Tasks & Datasets Do you find this in-depth technical education about language models and NLP applications to be useful? Subscribe below to … [Read more...] about Generalized Language Models: CoVe, ELMo & Cross-View Training