Collaborative AI Projects in Research and Application
Responsible: Gianni Franchi (ENSTA Paris / U2IS)
Teaching Assistants: Project tutors and PhD mentors depending on the topic
Assessment: Code quality (40%), Written report (30%), Final presentation (30%)
The CSC_5IA21 – Project IA course is a group-based project designed to immerse students in real-world artificial intelligence challenges. Students form teams of 4 to 6 members and select one of the proposed research-oriented projects. Each project focuses on applying and extending AI methods across domains such as deep learning, explainability, generative models, multimodal fusion, and trustworthy AI.
The objective of this course is to help students develop research autonomy, teamwork skills, and an applied understanding of modern AI methods while working under the supervision of expert tutors. The project outcome includes code implementation, a written report, and a final oral presentation.
P1 — Deep Fakes Image Detection: Develop and benchmark deep neural models capable of detecting manipulated or generated images from real ones.
P2 — Deep Fakes Sound & Multimodal Generation: Explore multimodal deepfake generation and detection by combining audio and video modalities.
P3 — Sound Unmixing: Implement and evaluate machine learning methods for separating overlapping sound sources. (Tutor: Mathieu F.)
P4 — Explainable AI & Mechanistic Interpretability: Analyze internal mechanisms of neural networks to improve interpretability and understanding of learned representations.
P5 — Physics-AI and Uncertainty: Combine physics-based modeling and machine learning, focusing on uncertainty estimation and trustworthiness of predictions.
P6 — Semantic Segmentation in Open-World Datasets: Train segmentation models on complex real-world datasets and address domain shift and OOD generalization.
P7 — LLMs and Hallucination: Study hallucination behaviors in Large Language Models and evaluate statistical or structural mitigation strategies.
P8 — Synthetic Data Generation for RAG Evaluation: Build synthetic datasets to evaluate Retrieval-Augmented Generation (RAG) systems and analyze their robustness.
P9 — Anomaly Detection in Microscopic Images: Develop image anomaly detection techniques for defect identification in microscopic data.
P10 — Analysis of Deep Learning Architectures and Training Dynamics: Examine how architecture choices and optimization parameters affect model behavior and performance.
P11 — Embodied AI, LVLMs & Vision-Language-Action Models: Explore the integration of large multimodal models in embodied robotics and action understanding tasks.
P12 — Efficient Image Generation: Investigate advanced optimization techniques such as LoRA, pruning, quantization, and ControlNet for faster and energy-efficient image generation.
Total: 10–11 sessions across the semester. Schedule meetings in advance to ensure tutor availability and consistency. Each group must document progress and decisions after every session using the Discord project channel, following the template shared in class.