Commit Graph

40 Commits

Author SHA1 Message Date
Jarvis
b8fde5ceb4 Fix another indentation issue in update_company_industry 2026-01-30 14:05:56 +00:00
Jarvis
97af86e509 Fix IndentationError in app.py 2026-01-30 14:04:40 +00:00
Jarvis
6ab2f10942 Fix syntax error in app.py 2026-01-30 13:58:42 +00:00
6ce3ca84eb [2f888f42] Container neu bauen und testne
Container neu bauen und testne
2026-01-30 11:55:37 +00:00
b184cf1d0f [2f388f42] Implementierung der UI-Anpassungen zur Anzeige von ausstehenden Fehlerberichten (rote Flagge in der Unternehmensliste, Anzeige im Inspector) und zur Ermöglichung weiterer Fehlerberichte. Backend-APIs wurden entsprechend erweitert.
Implementierung der UI-Anpassungen zur Anzeige von ausstehenden Fehlerberichten (rote Flagge in der Unternehmensliste, Anzeige im Inspector) und zur Ermöglichung weiterer Fehlerberichte. Backend-APIs wurden entsprechend erweitert.
2026-01-27 11:18:36 +00:00
5f3ff4a734 Merge branch 'main' of https://floke-gitea.duckdns.org/Floke/Brancheneinstufung2
# Conflicts:
#	company-explorer/frontend/src/components/Inspector.tsx
#	company-explorer/frontend/src/components/RoboticsSettings.tsx
2026-01-27 09:30:23 +00:00
d67245c50a feat(reporting): Implement 'Report Mistake' feature with API and UI [2f388f42] 2026-01-27 09:12:50 +00:00
ea9bb7c40c feat(reporting): Implement 'Report Mistake' feature with API and UI [2f388f42] 2026-01-27 09:00:20 +00:00
adafab61ae fix(transcription): [2f388f42] finalize and fix AI insights feature
This commit resolves all outstanding issues with the AI Insights feature.

- Corrects the transcript formatting logic in  to properly handle the database JSON structure, ensuring the AI receives the correct context.
- Fixes the Gemini API client by using the correct model name ('gemini-2.0-flash') and the proper client initialization.
- Updates  to securely pass the API key as an environment variable to the container.
- Cleans up the codebase by removing temporary debugging endpoints.
- Adds  script for programmatic updates.
- Updates documentation with troubleshooting insights from the implementation process.
2026-01-26 08:53:13 +00:00
01ae937f6e [2f388f42] Feature: Dynamically load Readme paths from Notion projects, update tasks and Notion_Dashboard. Refactor dev_session.py and remove old strategy documents. 2026-01-25 11:10:51 +00:00
31e1a5fc08 fix(classification): restore service logic and standardization formula
- Restored missing method implementations in ClassificationService (classify, extract_metrics)
- Fixed Standardization Logic not being applied in metric cascade
- Bumped version to v0.7.4 in config.py
- Removed duplicate API endpoint in app.py
- Updated MIGRATION_PLAN.md
2026-01-24 13:34:04 +00:00
01985aca33 fix(UI): Remove duplicate standardized potential block in Inspector 2026-01-24 12:07:04 +00:00
57360496f8 feat(Explorer): Enhance metric extraction, source transparency, and UI display
- **Standardization & Formula Logic:** Fixed NameError/SyntaxError in formula parser; added support for comments and capitalized placeholders.
- **Source URL Tracking:** Extended DB schema and cascade logic to store and track specific source URLs.
- **Frontend & UI:**
  - Added 'Standardized Potential' display in Inspector.
  - Added clickable source link with icon.
  - Fixed Settings tab layout collapse (flex-shrink-0).
- **Export Capabilities:**
  - Single-company JSON export now includes full quantitative metadata.
  - New global CSV export endpoint /api/companies/export.
- **System Integrity:**
  - Fixed Notion sync typo ('Stanardization').
  - Corrected Nginx proxy routing and FastAPI route ordering.
  - Ensured DB persistence via explicit docker-compose volume mapping.
2026-01-24 09:56:59 +00:00
5de7d38fcb docs: added regression tests for metric parser and documented them in GEMINI.md 2026-01-23 21:45:02 +00:00
13119b0e8a fix: removed aggressive string splitting on dashes which truncated sentences and hid metrics (Greilmeier bug) 2026-01-23 21:39:43 +00:00
f3ca139d85 fix: metric parser now aggressively cleans expected_value to handle units (e.g. '8.000 m²') 2026-01-23 21:26:24 +00:00
e43e129771 feat: robust metric extraction with confidence score and proof snippets
- fixed Year-Prefix Bug in MetricParser
- added metric_confidence and metric_proof_text to database
- added Entity-Check and Annual-Priority to LLM prompt
- improved UI: added confidence traffic light and mouse-over proof tooltip
- restored missing API endpoints (create, bulk, wiki-override)
2026-01-23 21:16:07 +00:00
c5652fc9b5 feat(app): Add wiki re-evaluation and fix wolfra bug
- Implemented a "Re-evaluate Wikipedia" button in the UI.

- Added a backend endpoint to trigger targeted Wikipedia metric extraction.

- Hardened the LLM metric extraction prompt to prevent hallucinations.

- Corrected several database path errors that caused data loss.

- Updated application version to 0.6.4 and documented the ongoing issue.
2026-01-23 16:05:44 +00:00
a33a60f462 fix(explorer): resolve initialization and import errors for v0.7.0 backend 2026-01-20 17:11:31 +00:00
4ff93cd8e6 feat(explorer): implement v0.7.0 quantitative potential analysis (cascade logic & metric extraction) 2026-01-20 16:38:05 +00:00
ea3d46f380 feat(explorer): add database migration script for industries schema 2026-01-20 15:38:20 +00:00
23b3e709b9 fix(explorer): resolve notion sync, add debug logging, and fix UI display for industries v0.6.1 2026-01-20 15:35:26 +00:00
5801ddc797 fix(ui): Remove unused handlers to fix Typescript build 2026-01-19 11:54:12 +00:00
46b59d4411 fix(ui): Escape '>' in JSX to fix build error 2026-01-19 11:52:38 +00:00
bbefb36f47 feat(ui): Update Industries settings to display Notion-synced fields read-only 2026-01-19 11:51:07 +00:00
7c2ae08c74 fix: Update Notion sync logic to handle existing records and avoid unique constraint errors 2026-01-19 11:37:55 +00:00
694be6eb4d fix: Restore previous DB, migrate schema, mount Notion token 2026-01-19 11:34:24 +00:00
35cf0c0753 feat: Implement Notion sync for Industries and Robotics Categories 2026-01-19 11:28:08 +00:00
ea87ace6e2 feat: Connect classification service to DB industries & update docs 2026-01-19 07:58:49 +00:00
4a336f6374 fix(ce): Resolve database schema mismatch and restore docs
- Fixed a critical  in the company-explorer by forcing a database re-initialization with a new file (). This ensures the application code is in sync with the database schema.
- Documented the schema mismatch incident and its resolution in MIGRATION_PLAN.md.

- Restored and enhanced BUILDER_APPS_MIGRATION.md by recovering extensive, valuable content from the git history that was accidentally deleted. The guide now again includes detailed troubleshooting steps and code templates for common migration pitfalls.
2026-01-15 15:54:45 +00:00
4b815c6510 feat(ce): upgrade to v0.5.0 with contacts management, advanced settings and ui modernization 2026-01-15 09:23:58 +00:00
1b5a8e3b96 Docs: Update MIGRATION_PLAN.md to v0.4.0 with new features (Company Explorer) 2026-01-09 10:18:45 +00:00
1f34534474 Update: GTM Architect v2.6.2 (Edit Specs, Report Fix) & Company Explorer v0.4 (Export, Timestamps) 2026-01-09 09:15:16 +00:00
fc119f74d8 fix(company-explorer): handle inconsistent LLM list responses in scraper
- Added logic to automatically flatten list-wrapped JSON responses from LLM in Impressum extraction.
- Fixed 'Unknown Legal Name' issue by ensuring property access on objects, not lists.
- Finalized v0.3.0 features and updated documentation with Lessons Learned.
2026-01-08 16:14:01 +01:00
94bac7c0ca fix(company-explorer): enhance impressum scraping debug logging
- Increased logging verbosity in  to track raw input to LLM and raw LLM response.
- This helps diagnose why Impressum data extraction might be failing for specific company websites.
2026-01-08 16:14:01 +01:00
b3fa036809 feat(company-explorer): force-refresh analysis and refine extraction logic
- Enforced fresh scrape on 'Analyze' request to bypass stale cache.
- Implemented 2-Hop Impressum scraping strategy (via Kontakt page).
- Refined numeric extraction for German locale (thousands separators).
- Updated documentation with Lessons Learned.
2026-01-08 16:14:01 +01:00
601593c65c feat(company-explorer): bump version to 0.3.0, add VAT ID extraction, and fix deep-link scraping
- Updated version to v0.3.0 (UI & Backend) to clear potential caching confusion.
- Enhanced Impressum scraper to extract VAT ID (Umsatzsteuer-ID).
- Implemented 2-Hop scraping strategy: Looks for 'Kontakt' page if Impressum isn't on the start page.
- Added VAT ID display to the Legal Data block in Inspector.
2026-01-08 16:14:01 +01:00
dbc3ce9b34 feat(company-explorer): add impressum scraping, robust json parsing, and enhanced ui polling
- Implemented Impressum scraping with Root-URL fallback and enhanced keyword detection.
- Added 'clean_json_response' helper to strip Markdown from LLM outputs, preventing JSONDecodeErrors.
- Improved numeric extraction for German formatting (thousands separators vs decimals).
- Updated Inspector UI with Polling logic for auto-refresh and display of AI Dossier and Legal Data.
- Added Manual Override for Website URL.
2026-01-08 16:14:01 +01:00
a43b01bb6e feat(company-explorer): add wikipedia integration, robotics settings, and manual overrides
- Ported robust Wikipedia extraction logic (categories, first paragraph) from legacy system.
- Implemented database-driven Robotics Category configuration with frontend settings UI.
- Updated Robotics Potential analysis to use Chain-of-Thought infrastructure reasoning.
- Added Manual Override features for Wikipedia URL (with locking) and Website URL (with re-scrape trigger).
- Enhanced Inspector UI with Wikipedia profile, category tags, and action buttons.
2026-01-08 16:14:01 +01:00
95634d7bb6 feat(company-explorer): Initial Web UI & Backend with Enrichment Flow
This commit introduces the foundational elements for the new "Company Explorer" web application, marking a significant step away from the legacy Google Sheets / CLI system.

Key changes include:
- Project Structure: A new  directory with separate  (FastAPI) and  (React/Vite) components.
- Data Persistence: Migration from Google Sheets to a local SQLite database () using SQLAlchemy.
- Core Utilities: Extraction and cleanup of essential helper functions (LLM wrappers, text utilities) into .
- Backend Services: , ,  for AI-powered analysis, and  logic.
- Frontend UI: Basic React application with company table, import wizard, and dynamic inspector sidebar.
- Docker Integration: Updated  and  for multi-stage builds and sideloading.
- Deployment & Access: Integrated into central Nginx proxy and dashboard, accessible via .

Lessons Learned & Fixed during development:
- Frontend Asset Loading: Addressed issues with Vite's  path and FastAPI's .
- TypeScript Configuration: Added  and .
- Database Schema Evolution: Solved  errors by forcing a new database file and correcting  override.
- Logging: Implemented robust file-based logging ().

This new foundation provides a powerful and maintainable platform for future B2B robotics lead generation.
2026-01-07 17:55:08 +00:00