Figure 1: Representation of the Text2SQL flow As our world is getting more global and dynamic, businesses are more and more dependent on data for making informed, objective and timely decisions. However, as of now, unleashing the full potential of organisational data is often a privilege of a handful of data scientists and analysts. Most employees don’t master the … [Read more...] about Creating An Information Edge With Conversational Access To Data
Language Models
Can ChatGPT Compete With Domain-Specific Sentiment Analysis Machine Learning Models?
ChatGPT is a GPT (Generative Pre-trained Transformer) machine learning (ML) tool that has surprised the world. Its breathtaking capabilities impress casual users, professionals, researchers, and even its own creators. Moreover, its capacity to be an ML model trained for general tasks and perform very well in domain-specific situations is impressive. I am a researcher, and its … [Read more...] about Can ChatGPT Compete With Domain-Specific Sentiment Analysis Machine Learning Models?
All Languages Are NOT Created (Tokenized) Equal
This article was originally published on the author’s blog and re-published to TOPBOTS with permission from the author. Large language models such as ChatGPT process and generate text sequences by first splitting the text into smaller units called tokens. In the image below, each colored block represents a unique token. Short or common words such as “you”, “say”, “loud”, and … [Read more...] about All Languages Are NOT Created (Tokenized) Equal
Four LLM Trends Since ChatGPT And Their Implications For AI Builders
In October 2022, I published an article on LLM selection for specific NLP use cases , such as conversation, translation and summarisation. Since then, AI has made a huge step forward, and in this article, we will review some of the trends of the past months as well as their implications for AI builders. Specifically, we will cover the topics of task selection for … [Read more...] about Four LLM Trends Since ChatGPT And Their Implications For AI Builders
Beyond ChatGPT: Open-Source Projects in Generative AI
In the realm of generative AI, open-source projects have emerged as powerful tools that democratize access to cutting-edge models and foster collaborative development. They enable researchers, developers, and enthusiasts to experiment, improve upon existing models, and create novel applications that benefit society as a whole. In this article, I want to briefly review the … [Read more...] about Beyond ChatGPT: Open-Source Projects in Generative AI
Exploring Creativity in Large Language Models: From GPT-2 to GPT-4
Creativity as envisioned by Midjourney. Generated by the author This article was originally published on the author’s blog and re-published to TOPBOTS with permission from the author. In recent weeks, people have used large language models (LLMs) to generate a variety of creative content, such as books, flash fiction, rap battles, and music chords. But is it possible to … [Read more...] about Exploring Creativity in Large Language Models: From GPT-2 to GPT-4
Does AI Have Political Opinions?
This article was originally published on the author's blog and re-published to TOPBOTS with permission from the author. There’s a quote about how in polite society, you should never talk about three things: politics, religion, and money. In this article, I break polite conventions to determine how an AI would respond to all three of those topics. As AI tools become more and … [Read more...] about Does AI Have Political Opinions?
Overcoming The Limitations Of Large Language Models
How popular LLMs score along human cognitive skills (source: semantic embedding analysis of ca. 400k AI-related online texts since 2021) Disclaimer: This article was written without the support of ChatGPT. In the last couple of years, Large Language Models (LLMs) such as ChatGPT, T5 and LaMDA have developed amazing skills to produce human language. We are quick to … [Read more...] about Overcoming The Limitations Of Large Language Models
Can Too Much BERT Be Bad for You?
BERT and GPT-2: we all love language models… I mean, who doesn’t? Language models like BERT and GPT-2 (and GPT-3) have had an enormous impact on the entire NLP field. Most of the models that obtained groundbreaking results on the famous GLUE benchmark are based on BERT. I, too, have benefited from BERT, since I released a library for topic modeling and some HuggingFace … [Read more...] about Can Too Much BERT Be Bad for You?
Fine-tune Transformers in PyTorch Using Hugging Face Transformers
This notebook is designed to use a pretrained transformers model and fine-tune it on a classification task. The focus of this tutorial will be on the code itself and how to adjust it to your needs. This notebook is using the AutoClasses from transformer by Hugging Face functionality. This functionality can guess a model’s configuration, tokenizer and … [Read more...] about Fine-tune Transformers in PyTorch Using Hugging Face Transformers