Text Similarity

Local Sensitive Hashing (LSH)— Complete Guide & How To Get Started In Python

What is local sensitive hashing? A technique for performing a rough nearest neighbour search in high-dimensional spaces is called local…

3 years ago

MinHash — How To Deal With Finding Similarity At Scale With Python Code To Get Started

What is MinHash? MinHash is a technique for estimating the similarity between two sets. It was first introduced in information…

3 years ago

SimHash — The Ultimate Guide And How To Get Started Guide In Python

What is SimHash? Simhash is a technique for generating a fixed-length "fingerprint" or "hash" of a variable-length input, such as…

3 years ago

How To Implement Fuzzy String Matching [4 Ways In Python]

This article discusses one of the most valuable tools when analysing textual data in natural language processing — fuzzy string…

3 years ago

Top 7 Ways To Implement Document & Text Similarity In Python: NLTK, Scikit-learn, BERT, RoBERTa, FastText and PyTorch

Text similarity is a really useful natural language processing (NLP) tool. It allows you to find similar pieces of text…

3 years ago