Aletheia: Automated AI safety testing platform
Building an automated AI saftey testing platform
For the next few months, I will be working Aletheia, a comprehensive AI safety platform that automatically tests, monitors, and evaluates AI models for alignment issues across multiple dimensions: truthfulness, helpfulness, harmlessness, and value alignment with continuous safety monitoring, red-teaming automation, and interpretability insights to ensure responsible AI deployment at scale.
I’m planning to utilize FastAPI and Python for the backend to call AI models with Together AI API calls, and React for a frontend web development.
I envision users selecting an AI model of their choice where the platform will automatically conduct safety tests (including jailbreaks, red-teaming, etc) to produce a comprehensive report on the model’s safety.
Development
Today, I started out with installing relevant dependancies for Together AI API calls and building a simple technical architecture
pip3 install together openai requests