Encoder, decoder and encoder-decoder transformers are a type of neural network currently at the bleeding edge in NLP. This article explains the difference between these architectures and what they...
Encoder, decoder and encoder-decoder transformers are a type of neural network currently at the bleeding edge in NLP. This article explains the difference between these architectures and what they...
What is a Hidden Markov Model in NLP? A time series of observations, such as a Hidden Markov Model (HMM), can be represented statistically as a probabilistic model. Natural language processing (NLP)...
What is deep learning for natural language processing? Deep learning is a part of machine learning based on how the brain works, especially the neural networks that make up the brain. It requires...
Neural machine translation (NMT) is a state-of-the-art technique for translation. Our previous article on translating text in Python covered the two most common ways of getting started with...
Transfer learning is explained, and the advantages and disadvantages are summed up. Types of transfer learning in NLP are summed up, and a list of the top models commonly used for transfer learning...
What is MinHash? MinHash is a technique for estimating the similarity between two sets. It was first introduced in information retrieval to evaluate the similarity between documents quickly. The...
What is SimHash? Simhash is a technique for generating a fixed-length "fingerprint" or "hash" of a variable-length input, such as a document or a piece of text. It is similar to a hash function and...
This article discusses one of the most valuable tools when analysing textual data in natural language processing — fuzzy string matching. We first discuss what it is, its typical applications and...
Abstractive text summarization is a valuable tool in Python when working with large documents, or you quickly want to summarize data. In this article, we discuss applications of abstractive text...
This list covers the top 7 machine learning algorithms and 8 deep learning algorithms used for NLP. If you are new to using machine learning algorithms for NLP, we suggest starting with the first...
This article covers reinforcement learning and its application in natural language processing (NLP). It also covered the latest developments in the field, a discussion on whether you should start...
This is a complete guide on utilising NLTK to build a whole preprocessing pipeline. Take the time to read through the different components so you know how to start building your pipeline. What is an...
In this guide, we cover how to start with the bag-of-words technique in Python. We first cover what a bag-of-words approach is and provide an example. We then cover the advantages and disadvantages...
Text classification is an important natural language processing (NLP) technique that allows us to turn unstructured data into structured data; many different algorithms allow you to do this, and so...
Text similarity is a really useful natural language processing (NLP) tool. It allows you to find similar pieces of text and has many real-world use cases. This article discusses text similarity, its...
What is text generation in NLP? Text generation is a subfield of natural language processing (NLP) that deals with generating text automatically. It has a wide range of applications, including...
This guide covers how to translate text in Python. Machine translation is a prominent natural language processing (NLP) application that is not very straightforward. We start by covering what is...
What is sentence embedding? Sentence embedding is a technique for representing a natural language sentence as a fixed-length numerical vector. The goal is to encode the semantic meaning and content...
Get a FREE PDF with expert predictions for 2026. How will natural language processing (NLP) impact businesses? What can we expect from the state-of-the-art models?
Find out this and more by subscribing* to our NLP newsletter.