Fuzzy string matching, grouping, and evaluation.
-
Updated
May 21, 2024 - Python
Fuzzy string matching, grouping, and evaluation.
Machine learning movie recommending system
Text2Text: Crosslingual NLP/G toolkit
Text vectorization tool to outperform TFIDF for classification tasks
several methods for text classification
Implementation with some extensions of the paper "Snowball: Extracting Relations from Large Plain-Text Collections" (Agichtein and Gravano, 2000)
A Python Search Engine for Humans 🥸
Stringlifier is on Opensource ML Library for detecting random strings in raw text. It can be used in sanitising logs, detecting accidentally exposed credentials and as a pre-processing step in unsupervised ML-based analysis of application text data.
Arabic Open Domain Question Answering System using Neural Reading Comprehension
中文文本分类实践,基于搜狗新闻语料库,采用传统机器学习方法以及预训练模型等方法
Social Analysis based on Whatsapp data
Term frequency–inverse document frequency for Chinese novel/documents implemented in python.
Document similarity algorithms experiment - Jaccard, TF-IDF, Doc2vec, USE, and BERT.
It is a content based recommender system that uses tf-idf and cosine similarity for N Most SImilar Items from a dataset
Add a description, image, and links to the tf-idf topic page so that developers can more easily learn about it.
To associate your repository with the tf-idf topic, visit your repo's landing page and select "manage topics."