Brancheneinstufung2

Author	SHA1	Message	Date
Floke	7c2ae08c74	fix: Update Notion sync logic to handle existing records and avoid unique constraint errors	2026-01-19 11:37:55 +00:00
Floke	694be6eb4d	fix: Restore previous DB, migrate schema, mount Notion token	2026-01-19 11:34:24 +00:00
Floke	35cf0c0753	feat: Implement Notion sync for Industries and Robotics Categories	2026-01-19 11:28:08 +00:00
Floke	ea87ace6e2	feat: Connect classification service to DB industries & update docs	2026-01-19 07:58:49 +00:00
Floke	4a336f6374	fix(ce): Resolve database schema mismatch and restore docs - Fixed a critical in the company-explorer by forcing a database re-initialization with a new file (). This ensures the application code is in sync with the database schema. - Documented the schema mismatch incident and its resolution in MIGRATION_PLAN.md. - Restored and enhanced BUILDER_APPS_MIGRATION.md by recovering extensive, valuable content from the git history that was accidentally deleted. The guide now again includes detailed troubleshooting steps and code templates for common migration pitfalls.	2026-01-15 15:54:45 +00:00
Floke	4b815c6510	feat(ce): upgrade to v0.5.0 with contacts management, advanced settings and ui modernization	2026-01-15 09:23:58 +00:00
Floke	1b5a8e3b96	Docs: Update MIGRATION_PLAN.md to v0.4.0 with new features (Company Explorer)	2026-01-09 10:18:45 +00:00
Floke	fc119f74d8	fix(company-explorer): handle inconsistent LLM list responses in scraper - Added logic to automatically flatten list-wrapped JSON responses from LLM in Impressum extraction. - Fixed 'Unknown Legal Name' issue by ensuring property access on objects, not lists. - Finalized v0.3.0 features and updated documentation with Lessons Learned.	2026-01-08 16:14:01 +01:00
Floke	94bac7c0ca	fix(company-explorer): enhance impressum scraping debug logging - Increased logging verbosity in to track raw input to LLM and raw LLM response. - This helps diagnose why Impressum data extraction might be failing for specific company websites.	2026-01-08 16:14:01 +01:00
Floke	b3fa036809	feat(company-explorer): force-refresh analysis and refine extraction logic - Enforced fresh scrape on 'Analyze' request to bypass stale cache. - Implemented 2-Hop Impressum scraping strategy (via Kontakt page). - Refined numeric extraction for German locale (thousands separators). - Updated documentation with Lessons Learned.	2026-01-08 16:14:01 +01:00
Floke	601593c65c	feat(company-explorer): bump version to 0.3.0, add VAT ID extraction, and fix deep-link scraping - Updated version to v0.3.0 (UI & Backend) to clear potential caching confusion. - Enhanced Impressum scraper to extract VAT ID (Umsatzsteuer-ID). - Implemented 2-Hop scraping strategy: Looks for 'Kontakt' page if Impressum isn't on the start page. - Added VAT ID display to the Legal Data block in Inspector.	2026-01-08 16:14:01 +01:00
Floke	dbc3ce9b34	feat(company-explorer): add impressum scraping, robust json parsing, and enhanced ui polling - Implemented Impressum scraping with Root-URL fallback and enhanced keyword detection. - Added 'clean_json_response' helper to strip Markdown from LLM outputs, preventing JSONDecodeErrors. - Improved numeric extraction for German formatting (thousands separators vs decimals). - Updated Inspector UI with Polling logic for auto-refresh and display of AI Dossier and Legal Data. - Added Manual Override for Website URL.	2026-01-08 16:14:01 +01:00
Floke	a43b01bb6e	feat(company-explorer): add wikipedia integration, robotics settings, and manual overrides - Ported robust Wikipedia extraction logic (categories, first paragraph) from legacy system. - Implemented database-driven Robotics Category configuration with frontend settings UI. - Updated Robotics Potential analysis to use Chain-of-Thought infrastructure reasoning. - Added Manual Override features for Wikipedia URL (with locking) and Website URL (with re-scrape trigger). - Enhanced Inspector UI with Wikipedia profile, category tags, and action buttons.	2026-01-08 16:14:01 +01:00
Floke	95634d7bb6	feat(company-explorer): Initial Web UI & Backend with Enrichment Flow This commit introduces the foundational elements for the new "Company Explorer" web application, marking a significant step away from the legacy Google Sheets / CLI system. Key changes include: - Project Structure: A new directory with separate (FastAPI) and (React/Vite) components. - Data Persistence: Migration from Google Sheets to a local SQLite database () using SQLAlchemy. - Core Utilities: Extraction and cleanup of essential helper functions (LLM wrappers, text utilities) into . - Backend Services: , , for AI-powered analysis, and logic. - Frontend UI: Basic React application with company table, import wizard, and dynamic inspector sidebar. - Docker Integration: Updated and for multi-stage builds and sideloading. - Deployment & Access: Integrated into central Nginx proxy and dashboard, accessible via . Lessons Learned & Fixed during development: - Frontend Asset Loading: Addressed issues with Vite's path and FastAPI's . - TypeScript Configuration: Added and . - Database Schema Evolution: Solved errors by forcing a new database file and correcting override. - Logging: Implemented robust file-based logging (). This new foundation provides a powerful and maintainable platform for future B2B robotics lead generation.	2026-01-07 17:55:08 +00:00

14 Commits