Skip to content

🪟🔎 Glassbox LLMs is a GDG open-source project exploring the inner workings of large language models. We combine hands-on experiments with the latest research to decode the “black box” of modern AI

Notifications You must be signed in to change notification settings

DSC-McMaster-U/glassbox-llms

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 

Repository files navigation

🪟 🔎 Glassbox LLMs

Welcome to the home for our exploration into Large Language Model (LLM) interpretability and explainability! 🚀

🧠 About

Ever wonder why AI says what it says? Today’s most powerful AI models are “black boxes”—even the experts can’t understand how decisions are made by the most widely used LLMs! In a world where AI impacts everything from healthcare to social media, understanding these black boxes is more important than ever for building trust, fairness, and reliable AI.

In this project, we’ll explore the current research landscape of LLM interpretability, try out some hands-on methods on real models, and dive into our own open-ended research questions.Our ultimate goal is to create and publish an open-source Python library that makes interpretability easy to add to any codebase that leverages LLMs. Together, we’ll demystify the inner workings of open weight models like Gemma and Llama, learning about the foundational black-box architecture of LLMs and how the can be made transformed into glass-boxes!

📚 What We Will Learn

  • The black-box architecture that supports modern day Large Language Models and why it can be problematic for interpretability 🕵️‍♀️
  • How to apply hands-on interpretability techniques to peek inside giant AI model: We'll take a look at sparse autoencoders, attention map analysis, feature attribution, and more 🧩
  • How to spot patterns, test model behavior, and think critically about AI decisions—skills that matter in the real world
  • How to review literature to unpack the bleeding edge research landscape 📖
  • Career-boosting know-how: These skills are hot in AI research AND industry! Stand out by showing you can both build and explain AI decision making systems 🌱

👐 Who Should Join

It's advisable for applicants to have some prior knowledge in ML concepts (like the neural network, gradient decent, etc.), however we will be covering an intro to LLM architecture in the beginning stages of this project! No advanced background required—just bring your questions and some curiosity. Everyone’s welcome!

About

🪟🔎 Glassbox LLMs is a GDG open-source project exploring the inner workings of large language models. We combine hands-on experiments with the latest research to decode the “black box” of modern AI

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 6