Transform a corpus of text documents (any kind) into a map with different zoom levels and topics names to summarise sub corpus of similar docs.
-
Updated
Jan 1, 2024 - HTML
Transform a corpus of text documents (any kind) into a map with different zoom levels and topics names to summarise sub corpus of similar docs.
Extract raw R code directly from webpages, including Github, Kaggle, Stack Overflow, and sites made using Blogdown.
Display Windows 10 notifications with coronavirus data in your country.
Machine translation using the seq2seq model
Towards Data Science articles directly to your Kindle
'Towards Data Science' articles RAG system
Information retrieval system that crawls, preprocesses, and indexes articles from Towards Data Science website, featuring TF-IDF scoring, cosine similarity search, and a Django web interface for querying documents. The system implements both Boolean and vector space models with support for stemming, stopword removal, inverted index
Full code for the Towards Data Science article: Data Science Plumbing.
This repository contains jupyter notebooks of the articles written by me on medium.com.
Add a description, image, and links to the towards-data-science topic page so that developers can more easily learn about it.
To associate your repository with the towards-data-science topic, visit your repo's landing page and select "manage topics."