🏩 JudgerAI — Your Dream Legal AI Assistant

Table of Contents:

🚀 Overview
📺 Demo
⚙️ Installation
📊 Dataset
🧹 Project Modules
🤖 AI Models
🧪 Experiments & Evaluation
🏅 Training Approach
📊 Ensemble Voting
📚 Notebooks & Further Reading

🚀 Overview

JudgerAI is an innovative NLP application that predicts legal case outcomes with impressive accuracy by analyzing past cases, precedents, and case facts. It empowers legal professionals to:

📈 Increase prediction accuracy
⏱️ Save valuable time on case research
🧠 Make informed, data-driven decisions

📺 Demo

Watch JudgerAI in action:

JudgerAI.2.0.-.Project.Demo.-.Trim.mp4

⚙️ Installation

1️⃣ Clone the repository

git clone https://github.com/MohammedAly22/JudgerAI.git

2️⃣ Download GloVe embeddings (50‌-dim) from Kaggle and save as:

./GloVe/glove.6B.50d.txt

3️⃣ Download pre-trained models (heavy files) and place them in models/. Download Models from Here

4️⃣ Directory structure:

JudgerAI/
├── csvs/
├── dataset/
├── GloVe/
├── models/
├── src/
└── *.ipynb

5️⃣ Run the app:

streamlit run src/main.py

📊 Dataset

Total cases: 3,464
Key columns: ID, name, href, first/second_party, winning_party, winner_index (0/1), facts
Input: facts → Output: winner_index

Here is the dataset summary:

column	datatype	description
`ID`	int64	Defines the case ID
`name`	string	Defines the case name
`href`	string	Defines the case hyper-reference
`first_party`	string	Defines the name of the first party (petitioner) of a case
`second_party`	string	Defines the name of the second party (respondent) of a case
`winning_party`	string	Defines the winning party name of a case
`winner_index`	int64	Defines the winning index of a case, 0 => the first party wins, 1 => the second party wins
`facts`	string	Contains the case facts that are needed to determine who is the winner of a specific case

🧹 Project Modules

Modular structure for maintainability and clarity:

Module	Location	Responsibilities
Preprocessing	`src/preprocessing.py`	Tokenization, balancing, anonymization, vectorization
Plotting	`src/plotting.py`	Visualizing performance, confusion matrices, ROC-AUC, heatmaps
Utils	`src/utils.py`	Training helpers, k-fold CV, accuracy/loss summary builders
Streamlit App	`src/main.py`	Frontend UI for demo and deployment
Deployment Utils	`src/deployment_utils.py`	Model loader, sample picker, vectorizer generator, highlights words

🤖 AI Models

JudgerAI incorporates 7 different models:

Doc2Vec – Documents as dense vectors
1D-CNN – Convolutional features over text
TF-IDF + TextVectorization – Weighted bag-of-words embedding
GloVe – Global co-occurrence embeddings
FastText – Subword-enhanced embeddings
LSTM – Memory‌-capable sequences
BERT – Contextual pre-trained transformer

A mix of traditional and modern architectures to maximize coverage.

🧪 Experiments & Evaluation

Three core preprocessing decisions were evaluated:

Preprocessing steps – stopword removal, stemming, etc.
Data anonymization – replacing party names with _PARTY_
Label imbalance – strategies for balanced classes

This results in 2³ = 8 experiments, each run with 4-fold cross-validation, giving thorough analysis across 32 total runs per model.

Combination #	Preprocessing	Data Anonymization	Label Class Imbalance
1	No	No	No
2	No	No	Yes
3	No	Yes	No
4	No	Yes	Yes
5	Yes	No	No
6	Yes	No	Yes
7	Yes	Yes	No
8	Yes	Yes	Yes

🏅 Training Approach

80/20 train/test split
4-fold CV on training set
Best combination selected per model based on accuracy

Workflow:

Train each model × 8 preprocessing setups × 4 CV folds
Evaluate and select best-performing model

📊 Ensemble Voting

Final predictions are generated through an ensemble voting method across all tuned models to ensure robustness.

📚 Notebooks & Further Reading

In-depth exploration available in the following notebooks:

BERT_experiments.ipynb
cnn_experiments.ipynb
doc2vec_experiments.ipynb
FastText_experiments.ipynb
glove_experiments.ipynb
LSTM_experiments.ipynb
tf_idf_experiments.ipynb
voting_experiments.ipynb

🤝 Contributing & License

Contributions welcome! Feel free to contribute to JudgerAI.

🙏 Thank You!

Thank you for exploring JudgerAI—ushering in a smarter future for legal decision-making with AI-powered precision.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🏩 JudgerAI — Your Dream Legal AI Assistant

🚀 Overview

📺 Demo

⚙️ Installation

📊 Dataset

🧹 Project Modules

🤖 AI Models

🧪 Experiments & Evaluation

🏅 Training Approach

📊 Ensemble Voting

📚 Notebooks & Further Reading

🤝 Contributing & License

🙏 Thank You!

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
csvs		csvs
dataset		dataset
src		src
BERT_experiments.ipynb		BERT_experiments.ipynb
FastText_experiments.ipynb		FastText_experiments.ipynb
LSTM_experiments.ipynb		LSTM_experiments.ipynb
README.md		README.md
cnn_experiments.ipynb		cnn_experiments.ipynb
doc2vec_experiments.ipynb		doc2vec_experiments.ipynb
glove_experiments.ipynb		glove_experiments.ipynb
graph.jpg		graph.jpg
tf_idf_experiments.ipynb		tf_idf_experiments.ipynb
voting_experiments.ipynb		voting_experiments.ipynb

Combination #	Preprocessing	Data Anonymization	Label Class Imbalance
1	No	No	No
2	No	No	Yes
3	No	Yes	No
4	No	Yes	Yes
5	Yes	No	No
6	Yes	No	Yes
7	Yes	Yes	No
8	Yes	Yes	Yes

Combination #	Preprocessing	Data Anonymization	Label Class Imbalance
1	No	No	No
2	No	No	Yes
3	No	Yes	No
4	No	Yes	Yes
5	Yes	No	No
6	Yes	No	Yes
7	Yes	Yes	No
8	Yes	Yes	Yes

MohammedAly22/JudgerAI

Folders and files

Latest commit

History

Repository files navigation

🏩 JudgerAI — Your Dream Legal AI Assistant

🚀 Overview

📺 Demo

⚙️ Installation

📊 Dataset

🧹 Project Modules

🤖 AI Models

🧪 Experiments & Evaluation

🏅 Training Approach

📊 Ensemble Voting

📚 Notebooks & Further Reading

🤝 Contributing & License

🙏 Thank You!

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages

Combination #	Preprocessing	Data Anonymization	Label Class Imbalance
1	No	No	No
2	No	No	Yes
3	No	Yes	No
4	No	Yes	Yes
5	Yes	No	No
6	Yes	No	Yes
7	Yes	Yes	No
8	Yes	Yes	Yes