🦛 CHONK your texts with Chonkie ✨ — The no-nonsense RAG chunking library
-
Updated
May 15, 2025 - Jupyter Notebook
🦛 CHONK your texts with Chonkie ✨ — The no-nonsense RAG chunking library
ChatGPT PROMPTs Splitter. Tool for safely process chunks of up to 15,000 characters per request
Fully neural approach for text chunking
🍱 semantic-chunking ⇢ semantically create chunks from large document for passing to LLM workflows
A sentence splitting (sentence boundary disambiguation) library for Go. It is rule-based and works out-of-the-box.
An agent with human in the loop that can search the web for information while bypassing bot detection for private sites.
🦛 CHONK your texts with Chonkie ✨ Type-friendly, light-weight, fast and super-simple chunking library
We compared LangChain, Fixie, and Marvin
In this we implements a Retrieval-Augmented Generation (RAG) based conversational AI agent designed for intelligent knowledge extraction from PDF documents. Leveraging LangChain and Google’s Gemini LLM
JChunk is a lightweight and flexible library designed to provide multiple strategies for text chunking within Spring Boot applications
Generative AI projetc using LangChain for similarity search. Input 3 articles urls and ask something about the topic
An exploration of text splitting and chunking in JavaScript
Leveraging Langchain for a RAG (Retriever Augmented Generation) project, this implementation enables efficient querying across multiple books, enhancing data retrieval and natural language generation for context-rich answers.
A lightweight TypeScript text splitter for RAG applications
This repository covers all the code materials covered within Jose Portilla's Langchain with Python Bootcamp on Udemy.
Allows you to upload to GitHub text files over 100MB
Script TCL pour EGGDROP sur IRC, permettant la division de textes en blocs selon une longueur spécifiée. Il respecte les codes de formatage IRC et facilite la gestion et la manipulation des messages IRC.
Successfully developed an LLM application which generates a summary, a list of citations and references and response to a user's query based on the research paper's content.
Text splitting example using Tiktoken
Add a description, image, and links to the text-splitter topic page so that developers can more easily learn about it.
To associate your repository with the text-splitter topic, visit your repo's landing page and select "manage topics."