Trigger Recognition for Enjoyable and Appropriate Television - EXAONE

I was tired of getting grossed out watching unexpected scenes in movies and TV and losing my appetite, that's why I created TREAT.

The goal of this project is to empower viewers by forewarning them about potential triggers in the content they watch, making the viewing experience more enjoyable, inclusive, and appropriate for everyone.

TREAT is a web application that uses natural language processing to analyze movie and TV show scripts, identifying potential triggers to help viewers make informed choices.

Installation Instructions

Prerequisites

Star the Repository to Show Your Support :P

Clone the Repository to Your Local Machine:

git clone https://github.com/Kuberwastaken/TREAT-EXAONE.git

Environment Setup

To set up the development environment, you will need to create a virtual environment and install the necessary dependencies.

Create a Virtual Environment:
```
python3 -m venv TREAT-EXAONE
```

Activate the Virtual Environment:

source treat-env/bin/activate   # On Unix or MacOS
treat-env\Scripts\activate      # On Windows

Install Dependencies:

Navigate to the project directory and run:
```
pip install -r requirements.txt
```

Project Usage

Start the Flask Server:
```
python run.py
```
Open Your Browser:

Navigate to http://127.0.0.1:5000 to access the TREAT web interface.
Analyze Scripts:

You can manually enter a script in the provided text area and click "Analyze Script."

File Descriptions

app.py: The main Flask application file that handles routing.
app/routes.py: Contains the Flask routes for handling script uploads.
app/model.py: Includes the script analysis functions using the LGAI EXAONE 2.4B model.
templates/index.html: The main HTML file for the web interface.
static/css/style.css: Custom CSS for styling the web interface.
static/js/app.js: JavaScript for handling client-side interactions.

Types of Triggers Detected

The TREAT application focuses on identifying a variety of potential triggers in scripts, including but not limited to:

Violence: Scenes of physical aggression or harm.
Self-Harm: Depictions of self-inflicted injury.
Death: Depictions of death or dying characters.
Sexual Content: Any depiction or mention of sexual activity, intimacy, or behavior.
Sexual Abuse: Instances of sexual violence or exploitation.
Gun Use: Depictions of firearms and their usage.
Gore: Graphic depiction of injury, blood, or dismemberment.
Vomit: Depictions of vomiting or nausea-inducing content.
Mental Health Issues: Depictions of mental health struggles, including anxiety, depression, or disorders.
Animal Cruelty: Depictions of harm or abuse towards animals.

These categories help address a very real-world problem by forewarning viewers about potentially distressing content, enhancing their viewing experience.

Adding new categories is as simple as specifying a new category under model.py and utils.py

Design Choices

Inspiration: I aimed for a simple and intuitive user experience, focusing on simplicity and ease of use. This decision stemmed from the need to create a tool that is easy to navigate for all users, regardless of background or age.
Theme and Color Scheme: The chosen theme and color scheme create a visually appealing and engaging environment. The chocolate and sweets theme is intended to stick to the TREAT theme and make the experience enjoyable and pleasant.
Script Analysis: The LGAI EXAONE 2.4B model by LG AI Research was chosen for its efficiency and performance in text analysis tasks. As part of the EXAONE 3.5 series, it offers state-of-the-art performance for its size, with support for both English and Korean languages. The model is optimized for small devices while maintaining high accuracy, making it accessible for users with limited hardware. It can handle long contexts up to 32K tokens, improving script analysis for larger texts.

Switching Between Models

TREAT is designed to be flexible with different language models. You can easily switch between models by editing the model.py file:

Open app/model.py
Locate the model initialization section (typically at the beginning of the file)

Change the model name from:

model_name = "LGAI-EXAONE/EXAONE-3.5-2.4B-Instruct"

To any other compatible model, such as:

model_name = "deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B"  # For DeepSeek-R1 1.5B

or:

model_name = "LGAI-EXAONE/EXAONE-3.5-7.8B-Instruct"  # For larger EXAONE model

Adjust the prompt template if necessary to match the expected format of the new model.
Remember to adjust parameters like max_chunk_size based on the capabilities of the selected model.

Model Configuration Guide

Core Configuration Parameters ⚙️

1. Text Chunking Settings

# Located in analyze_script()
max_chunk_size = 1024  # Text segment length (tokens)
overlap = 128           # Context preservation between chunks

Recommended Adjustments:

For long dialogues:
```
max_chunk_size = 1536
overlap = 256
```
For action-heavy scripts:
```
max_chunk_size = 768
overlap = 64
```

2. Generation Controls

# In model.generate() parameters
{
    "temperature": 0.2,          # Range: 0.1 (strict) - 9.0 (creative)
    "top_p": 0.9,                # Range: 0.8 (focused) - 1.0 (diverse)
    "repetition_penalty": 1.05   # Range: 1.0 (none) - 2.0 (strict)
}

Use Case Examples:

For sensitive content analysis:

{"temperature": 0.1, "top_p": 0.8, "repetition_penalty": 1.2}

For creative interpretation:

{"temperature": 0.7, "top_p": 0.95, "repetition_penalty": 1.0}

3. Prompt Structure for LGAI EXAONE 2.4B

messages = [
    {"role": "system", "content": "You are a helpful assistant specialized in analyzing scripts for trigger content."},
    {"role": "user", "content": f"""TEXT ANALYSIS:
Respond ONLY with this exact format:

VIOLENCE: [YES/NO]
...
MENTAL_HEALTH: [YES/NO]

Text: {chunk[:768]}..."""}
]

Customization Guide:

Maintain Respond ONLY... directive.
Keep category list order consistent.
Preserve ... after the text preview.
Adjust the system message to refine the model's behavior if needed.

4. Response Parsing Logic

# Category normalization
category_map = {
    cat.upper().replace("_", " "): cat 
    for cat in expected_order
}

# Answer recognition pattern
pattern = r"\b({})\b\s*[:=]\s*\[?(YES|NO|MAYBE|Y|N|M)\]?".format(
    "|".join(re.escape(cat) for cat in category_map.keys())
)

Configuration Reference Table 📋

Parameter	Location	Default	Effect Range
`max_chunk_size`	analyze_script()	1024	512-2048
`overlap`	analyze_script()	128	32-256
`temperature`	model.generate()	0.2	0.1-1.0
`top_p`	model.generate()	0.9	0.7-1.0
`repetition_penalty`	model.generate()	1.05	1.0-2.0

Advanced Customization 🛠️

Adding New Categories

Add to expected_order list:

expected_order = [
    ...,
    "NEW_CATEGORY"
]

Update the prompt template format section.

Test parsing with:

# Test pattern with new category
test_text = "NEW_CATEGORY: YES"
assert "NEW_CATEGORY" in extract_answers(test_text, expected_order)

Modifying Response Format

Edit the regex pattern for different answer formats:

# Example: Allow 'Y'/'N' shorthand
pattern = r"\b({})\b\s*[:=]\s*\[?(Y|N)\]?".format(...)

Optimizing Performance

Reduce VRAM usage:
```
max_chunk_size = 768
overlap = 64
```
Faster processing (less accurate):
```
{"temperature": 0.1, "top_p": 0.8}
```

To-Do List

Fixing the model working but not the output
Parallel Processing of Multiple chunks
Optimizing LGAI EXAONE 2.4B model parameters for better trigger detection
Adding support for additional languages using the model's bilingual capabilities

Acknowledgements

I would like to thank:

LG AI Research: For developing and providing open access to the LGAI EXAONE models, a very critical component of this project.
Parasite (2019): For that unexpected jumpscare that ruined my appetite and ultimately inspired this project.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
static		static
templates		templates
test_Files		test_Files
treat/app		treat/app
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Trigger Recognition for Enjoyable and Appropriate Television - EXAONE

Installation Instructions

Prerequisites

Environment Setup

Project Usage

File Descriptions

Types of Triggers Detected

Design Choices

Switching Between Models

Model Configuration Guide

Core Configuration Parameters ⚙️

1. Text Chunking Settings

Recommended Adjustments:

2. Generation Controls

Use Case Examples:

3. Prompt Structure for LGAI EXAONE 2.4B

Customization Guide:

4. Response Parsing Logic

Configuration Reference Table 📋

Advanced Customization 🛠️

Adding New Categories

Modifying Response Format

Optimizing Performance

To-Do List

Acknowledgements

About

Uh oh!

Uh oh!

Languages

Kuberwastaken/TREAT-EXAONE

Folders and files

Latest commit

History

Repository files navigation

Trigger Recognition for Enjoyable and Appropriate Television - EXAONE

Installation Instructions

Prerequisites

Environment Setup

Project Usage

File Descriptions

Types of Triggers Detected

Design Choices

Switching Between Models

Model Configuration Guide

Core Configuration Parameters ⚙️

1. Text Chunking Settings

Recommended Adjustments:

2. Generation Controls

Use Case Examples:

3. Prompt Structure for LGAI EXAONE 2.4B

Customization Guide:

4. Response Parsing Logic

Configuration Reference Table 📋

Advanced Customization 🛠️

Adding New Categories

Modifying Response Format

Optimizing Performance

To-Do List

Acknowledgements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages