Best 96 Interpretability Open Source Projects

The Truth Is In There: Improving Reasoning in Language Models with Layer...

📍 Interactive Studio for Explanatory Model Analysis

CLIP Surgery for Better Explainability with Enhancement in Open-Vocabula...

Interpretability for sequence generation models 🐛 🔍

💡 Adversarial attacks on explanations and how to defend them

Diffusers-Interpret 🤗🧨🕵️‍♀️: Model explainability for 🤗 Diffusers. Ge...

For calculating global feature importance using Shapley values.

Layer-wise Relevance Propagation (LRP) for LSTMs.

[ECCV 2020] QAConv: Interpretable and Generalizable Person Re-Identifica...

Zennit is a high-level framework in Python using PyTorch for explaining/...

🏥 Visualizing Convolutional Networks for MRI-based Diagnosis of Alzhei...

Concept Bottleneck Models, ICML 2020

This repository introduces MentaLLaMA, the first open-source instruction...

Collection of NLP model explanations and accompanying analysis tools

A Python library for Secure and Explainable Machine Learning