Aletheia: Automated AI safety testing platform

less than 1 minute read

Building an automated AI saftey testing platform

For the next few months, I will be working Aletheia, a comprehensive AI safety platform that automatically tests, monitors, and evaluates AI models for alignment issues across multiple dimensions: truthfulness, helpfulness, harmlessness, and value alignment with continuous safety monitoring, red-teaming automation, and interpretability insights to ensure responsible AI deployment at scale.

I’m planning to utilize FastAPI and Python for the backend to call AI models with Together AI API calls, and React for a frontend web development.

I envision users selecting an AI model of their choice where the platform will automatically conduct safety tests (including jailbreaks, red-teaming, etc) to produce a comprehensive report on the model’s safety.

Development

Today, I started out with installing relevant dependancies for Together AI API calls and building a simple technical architecture

pip3 install together openai requests

Screenshot 2025-08-18 at 12 11 41 PM

Share on

X Facebook LinkedIn Bluesky

Nathan Baek

Aletheia: Automated AI safety testing platform

Building an automated AI saftey testing platform

Development

Share on

You May Also Enjoy

LLM Fine-tuning: Day 8 (END)

LLM Fine-tuning: Day 7

LLM Fine-tuning: Day 6

TPS Website