Technical Pipeline / Project Architecture =========================================== .. raw:: html
The overall process for implementing the project is structured in the following pipeline:
1. Global Project Architecture --------------------------- .. figure:: /Documentation/images/arch.PNG :width: 100% :alt: Alternative text for the image :name: logo 1. Data Collection & Preprocessing _________________________________ .. figure:: /Documentation/images/dc.PNG :width: 100% :alt: Alternative text for the image :name: logo .. raw:: htmlExtract data from existing sources such as PDF files, Excel spreadsheets, technical documents, and internal databases.
Clean, structure, and normalize textual data to make it usable by AI models.
Tools used may include PyMuPDF, Pandas, OpenRefine, etc.
2. Knowledge Base Creation ___________________________ .. raw:: htmlOrganize the extracted data into a Knowledge Base.
Structure the information by component, requirement, test type, and associated documentation.
Possible technologies include PostgreSQL, MongoDB, and vector databases like FAISS or Weaviate.
3. Benchmarking & Integration of LLM / RAG Models _______________________________________________ .. figure:: /Documentation/images/llm.PNG :width: 100% :alt: Alternative text for the image :name: logo .. raw:: htmlSelect and experiment with different language models (LLMs) such as GPT-4, Mistral, LLaMA, etc.
Integrate a Retrieval-Augmented Generation (RAG)architecture that combines knowledge base search with text generation.
Evaluate performance, accuracy of responses, and domain adaptation for the automotive context.
4. Automatic Document Generation __________________________________ .. figure:: /Documentation/images/docgen.PNG :width: 100% :alt: Alternative text for the image :name: logo .. raw:: htmlUse generative AI models to automatically produce:
- Technical specifications - Test plans - Validation reports .. raw:: htmlSupported formats: PDF, DOCX, JSON.
Integration options: LangChain, LlamaIndex, or HuggingFace Transformers.
5. Development of the Web Platform __________________________________ .. figure:: /Documentation/images/platf.PNG :width: 100% :alt: Alternative text for the image :name: logo .. raw:: html- Create a user-friendly interface allowing:
- Input of the component name - Submission of questions - Download of generated documents .. raw:: html- Proposed technologies include: React.js or Vue.js (frontend), FastAPI or Django (backend), with Docker used for deployment.
2. Key Application Features -------------------------- .. raw:: html| Feature | Description |
|---|---|
| Intelligent Search | Find requirements based on the component name. |
| AI Q&A Agent | Answer questions regarding the component's technical details. |
| Document Generation | Generate test sheets, specifications, and validation reports. |
| Requirement Explanation | Provide in-depth explanations of requirement functions, impacts, and associated tests. |
An intelligent web platform that transforms a simple component name into a rich set of technical information and documents. This will significantly reduce the time required for analysis, documentation, and validation in the automotive domain.
Technologies & Tools _____________________ .. raw:: html| Domain | Tools / Technologies |
|---|---|
| Data Extraction | PyMuPDF, textract, Pandas |
| AI & NLP | OpenAI GPT, LangChain, HuggingFace |
| Vector Databases | FAISS, ChromaDB, Weaviate |
| Backend | FastAPI, Flask, Django |
| Frontend | React, Vue.js |
| Documentation | MkDocs, Sphinx, Notion |