5.9 KiB
5.9 KiB
Gemini Code Assistant Context
Wichtige Hinweise
- Projektdokumentation: Die primäre und umfassendste Dokumentation für dieses Projekt befindet sich in der Datei
readme.md. Bitte ziehen Sie diese Datei für ein detailliertes Verständnis der Architektur und der einzelnen Module zu Rate. - Git-Repository: Dieses Projekt wird über ein Git-Repository verwaltet. Alle Änderungen am Code werden versioniert. Beachten Sie den Abschnitt "Git Workflow & Conventions" für unsere Arbeitsregeln.
Project Overview
This project is a Python-based system for automated company data enrichment and lead generation. It uses a variety of data sources, including web scraping, Wikipedia, and the OpenAI API, to enrich company data from a CRM system. The project is designed to run in a Docker container and can be controlled via a Flask API.
The system is modular and consists of the following key components:
brancheneinstufung_167.py: The core module for data enrichment, including web scraping, Wikipedia lookups, and AI-based analysis.company_deduplicator.py: A module for intelligent duplicate checking, both for external lists and internal CRM data.generate_marketing_text.py: An engine for creating personalized marketing texts.app.py: A Flask application that provides an API to run the different modules.
Git Workflow & Conventions
- Commit-Nachrichten: Commits sollen einem klaren Format folgen:
- Titel: Eine prägnante Zusammenfassung unter 100 Zeichen.
- Beschreibung: Detaillierte Änderungen als Liste mit
-am Zeilenanfang (keine Bulletpoints).
- Datei-Umbenennungen: Um die Git-Historie einer Datei zu erhalten, muss sie zwingend mit
git mv alter_name.py neuer_name.pyumbenannt werden. - Commit & Push Prozess: Änderungen werden zuerst lokal committet. Das Pushen auf den Remote-Server erfolgt erst nach expliziter Bestätigung durch Sie.
- Anzeige der Historie: Web-Oberflächen wie Gitea zeigen die Historie einer umbenannten Datei möglicherweise nicht vollständig an. Die korrekte und vollständige Historie kann auf der Kommandozeile mit
git log --follow <dateiname>eingesehen werden.
Building and Running
The project is designed to be run in a Docker container. The Dockerfile contains the instructions to build the container.
To build the Docker container:
docker build -t company-enrichment .
To run the Docker container:
docker run -p 8080:8080 company-enrichment
The application will be available at http://localhost:8080.
Development Conventions
- Configuration: The project uses a
config.pyfile to manage configuration settings. - Dependencies: Python dependencies are listed in the
requirements.txtfile. - Modularity: The code is modular and well-structured, with helper functions and classes to handle specific tasks.
- API: The Flask application in
app.pyprovides an API to interact with the system. - Logging: The project uses the
loggingmodule to log information and errors. - Error Handling: The
readme.mdindicates a critical error related to theopenailibrary. The next step is to downgrade the library to a compatible version.
Current Status (Jan 05, 2026) - GTM & Market Intel Fixes
-
GTM Architect (v2.4) - UI/UX Refinement:
- Corporate Design Integration: A central, customizable
CORPORATE_DESIGN_PROMPTwas introduced inconfig.pyto ensure all generated images strictly follow a "clean, professional, photorealistic" B2B style, avoiding comic aesthetics. - Aspect Ratio Control: Implemented user-selectable aspect ratios (16:9, 9:16, 1:1, 4:3) in the frontend (Phase 6), passing through to the Google Imagen/Gemini 2.5 API.
- Frontend Fix: Resolved a double-declaration bug in
App.tsxthat prevented the build.
- Corporate Design Integration: A central, customizable
-
Market Intelligence Tool (v1.2) - Backend Hardening:
- "Failed to fetch" Resolved: Fixed a critical Nginx routing issue by forcing the frontend to use relative API paths (
./api) instead of absolute ports, ensuring requests correctly pass through the reverse proxy in Docker. - JSON Stability: The Python Orchestrator and Node.js bridge were hardened against invalid JSON output. The system now robustly handles stdout noise and logs full raw output to
/app/Log/server_dump.txtin case of errors. - Language Support: Implemented a
--languageflag. The tool now correctly respects the frontend language selection (defaulting to German) and forces the LLM to output German text for signals, ICPs, and outreach campaigns. - Logging: Fixed log volume mounting paths to ensure debug logs are persisted and accessible.
- "Failed to fetch" Resolved: Fixed a critical Nginx routing issue by forcing the frontend to use relative API paths (
Current Status (Jan 2026) - GTM Architect & Core Updates
- GTM Architect (v2.2) - FULLY OPERATIONAL:
- Image Generation Fixed: Successfully implemented a hybrid image generation pipeline.
- Text-to-Image: Uses
imagen-4.0-generate-001for generic scenes. - Image-to-Image: Uses
gemini-2.5-flash-imagewith reference image upload for product-consistent visuals. - Prompt Engineering: Strict prompts ensure the product design remains unaltered.
- Text-to-Image: Uses
- Library Upgrade: Migrated core AI logic to
google-genai(v1.x) to resolve deprecation warnings and access newer models.Pillowadded for image processing. - Model Update: Switched text generation to
gemini-2.0-flashdue to regional unavailability of 1.5. - Frontend Stability: Fixed a critical React crash in Phase 3 by handling object-based role descriptions robustly.
- Infrastructure: Updated Docker configurations (
gtm-architect/requirements.txt) to support new dependencies.
- Image Generation Fixed: Successfully implemented a hybrid image generation pipeline.
Next Steps
- Monitor Logs: Check
Log_from_docker/for detailed execution traces of the GTM Architect. - Feedback Loop: Verify the quality of the generated GTM strategies and adjust prompts in
gtm_architect_orchestrator.pyif necessary.