A small, fast, local-first, searchable index for client side apps written in Typescript. Supports required, negated, and phrase queries.
-
Updated
Jun 12, 2024 - TypeScript
A small, fast, local-first, searchable index for client side apps written in Typescript. Supports required, negated, and phrase queries.
An internet search engine written mostly in python. Currently TF-IDF based.
An Apache Spark application to analyze word frequencies and compute TF-IDF weights across multiple text file sets using Spark's MLlib library.
Search anything, instantly
This repository is about my semester project related to the course Big Data Analytics. I developed a recommendation system engine based on Hotel Reviews Dataset. Read more about the implementation in markdown file.
A platform enables sharing diverse knowledge, but similarly worded questions are common. We use NLP techniques to identify duplicate questions, enhancing user experience by making it easier to find high-quality answers.
An NLP based APP includes features for spam detection, sentiment analysis, stress detection, hate and offensive content detection, and sarcasm detection. It leverages Natural Language Processing (NLP) techniques and machine learning models to analyze and classify text inputs. Table of Contents
This project involves developing a machine learning model to predict user preferences in chatbot conversations, using a dataset of head-to-head responses from various large language models. The goal is to enhance chatbot-human interactions by aligning chatbot responses more closely with human preferences.
This is a Python tool for Streamlit that automates redirect mappings during site migrations by matching URLs from an old site to a new site based on content similarity.
A utility library for comparing strings via Cosine Similarity
Legal case retrieval challenge. Solution based on similarity search and learning-to-rank methods
An ML-based project designed to accurately classify email messages as either spam or ham (non-spam)
This repository houses a comprehensive Machine Learning project aimed at classifying Yelp reviews using Multinomial Naive Bayes and Natural Language Processing (NLP) techniques.
Apply ensemble technique of model stacking to predict patient's readmission
Slides, exercises, and exams for my course "Natural Language Processing" (École Pour l'Informatique et les Techniques Avancées, 2024)
Artificial Intelligence Laboratory (6th semester) course's project.
Simple chatbot (NLP ONLY without machine learning) using Levenshtein Distance + TF-IDF + Cosine Similiarity :D
A custom search engine built with Rust. It parses HTML files and utilizes TF-IDF scoring to rank document relevance based on search queries. The project includes a Rust-based backend server and vanilla HTML/CSS for the web frontend.
Add a description, image, and links to the tf-idf topic page so that developers can more easily learn about it.
To associate your repository with the tf-idf topic, visit your repo's landing page and select "manage topics."