BanGUI

Author	SHA1	Message	Date
Lukas	0a3f9c6c16	refactor(backend): external logging metrics, required mode, health checks - Add external_logging_init_failures counter - Add external_log_required flag, raise if init fails and required - Health endpoint: add external_logging status check - Blocklist service: enrich with metadata fields, update import logic - Health check task: add runtime_state dependency, fix return typing - Metrics: add Histogram for request latencies - Frontend: align BlocklistImportLogSection props - Docs: update deployment guide, remove stale tasks Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-05-04 03:45:13 +02:00
Lukas	cee3daffc1	fix: enforce PRAGMA query_only on fail2ban DB and refactor CSRF cookie name - Add _acquire_readonly_connection() that applies PRAGMA query_only=ON after connect - Verify PRAGMA value back to catch URI flag bypasses - Wrap in async context manager _readonly_connection() used by all repo methods - Replace hardcoded '_SESSION_COOKIE_NAME' in CSRF middleware with import from app.utils.constants - Remove completed Issues #45 and #46 from Docs/Tasks.md (Issue #46 now fixed, #45 cache invalidation deferred to auth refactor branch) Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-05-03 21:47:42 +02:00
Lukas	e436727942	fix: atomic upsert for import runs (Issue #12 ) Replace check-then-insert race condition with INSERT ON CONFLICT. - upsert_pending uses RETURNING id for atomic upsert - UNIQUE(source_id, content_hash) constraint from migration 6 - blocklist_import_workflow updated to use upsert_pending - test_import_source_success fixed for async mock patterns Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-05-02 23:39:43 +02:00
Lukas	1285bc8571	feat: comprehensive health check with DB, scheduler, cache - Add /api/v1/health endpoint with component-level checks - Verify DB connectivity, fail2ban socket, scheduler, session cache - Add SQLite WAL cleanup on startup (orphan crash files) - Migration 8: import_log.timestamp → INTEGER UNIX epoch - Align import_log timestamps with history_archive (already UNIX int) - Add unit tests for DB cleanup and health router Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-05-02 23:03:57 +02:00
Lukas	cc6dbcf3f0	feat: implement API versioning /api/v1/ - All backend routers moved to /api/v1/ prefix - Frontend BASE_URL updated to /api/v1 - Setup redirect middleware updated to redirect to /api/v1/setup - Health router path fixed: prefix=/api/v1/health, @router.get('') - conftest.py: set server_status=online for test fixture - Created Docs/API_VERSIONING.md with deprecation policy - Updated Docs/Backend-Development.md with versioning section - Updated Instructions.md curl examples Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-05-02 21:29:30 +02:00
Lukas	0d5882b32f	Fix HIGH priority issues: unbounded queries, rate limiting, health checks Issue #3 - Unbounded Query Results (OOM): - get_all_archived_history() now uses keyset pagination with bounded max_rows (50k default) - Added 'id' field to records from get_archived_history() and get_archived_history_keyset() - Protocol signature updated with page_size, max_rows, last_ban_id params Issue #7 - Docker Health Check Fails: - Added curl to Dockerfile.backend runtime image - HEALTHCHECK now uses 'curl -f http://localhost:8000/api/health' - compose.prod.yml: increased start_period to 40s, timeout to 10s - Frontend healthcheck proxies to backend /api/health Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-05-01 21:47:36 +02:00
Lukas	67b26a3ef7	Refactor pagination with cursor-based support and standardized response format - Implement cursor-based pagination in pagination.py - Update response models to standardize pagination structure - Add cursor pagination utilities for repositories - Update HistoryArchiveRepository and ImportLogRepository with new pagination - Add comprehensive tests for cursor pagination - Update documentation for backend development and task tracking Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-05-01 17:54:05 +02:00
Lukas	f9e283541b	Add explicit database transaction isolation to multi-step operations This commit addresses race conditions in multi-step database operations by: 1. Wrap write operations in BEGIN IMMEDIATE ... COMMIT transactions: - import_run_repo: create_pending, mark_completed, mark_failed - geo_cache_repo: all upsert_*_and_commit functions - geo_cache_repo: bulk_upsert_entries_and_neg_entries_and_commit 2. Handle concurrent write collisions gracefully: - import_run_repo.create_pending can now raise IntegrityError - blocklist_import_workflow catches IntegrityError and retries lookup - Logs 'blocklist_import_lost_race' event when another request wins the race 3. Add comprehensive documentation: - Backend-Development.md § 6.3 Database Transactions - Explains when to use BEGIN IMMEDIATE - Shows transaction pattern with try-except-rollback - Documents race condition error handling pattern The solution leverages SQLite's UNIQUE constraint for data integrity while handling the concurrent case gracefully in application logic. This is more efficient than using BEGIN EXCLUSIVE which would serialize all writers. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-04-30 22:04:15 +02:00
Lukas	52f237d5d4	Make background tasks idempotent - prevent duplicate bans on retry CRITICAL FIX: Background tasks (especially blocklist_import) crashed mid-execution, leaving partial state. On retry, the same bans were applied again, causing duplicates. Solution: Content-hash based operation tracking for blocklist imports: - Added import_runs table (migration 6) to track operations by source + content hash - Before banning, check if this exact content has already been imported - If completed: skip banning (already done), optionally re-warm cache - If new or failed: proceed with ban and mark as completed or failed Changes: - Database: Migration 6 adds import_runs table with operation state tracking - Model: Added ImportRunEntry for import run records - Repository: New import_run_repo module with CRUD operations - Workflow: Updated blocklist_import_workflow to check operation history before banning - Dependencies: Registered import_run_repo for dependency injection - Tests: Added test_import_source_idempotent_on_retry and test_import_source_different_content_not_reused - Documentation: Added Task Idempotency section to Backend-Development.md Verification: - All 7 import tests pass (5 existing + 2 new idempotency tests) - Type checking: mypy --strict ✅ - Linting: ruff ✅ - No API changes, backwards compatible via automatic migration Fixes: Background tasks not idempotent #CRITICAL Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-04-30 21:54:14 +02:00
Lukas	1302ac821f	Fix non-atomic setup persistence across DB contexts (Issue #30 ) Implement transactional setup with explicit state machine and crash-safety to prevent partial commits from leaving inconsistent state. ## Changes ### Core Implementation 1. settings_repo.py: Add atomic batch settings write - New set_settings_batch() method: writes multiple settings in single transaction (BEGIN IMMEDIATE ... COMMIT). Either all settings persist or none do, preventing partial state if crash occurs mid-batch. 2. setup_service.py: Refactor run_setup() with transactional phases - Phase 0: Compute password hash early (before any DB writes) to ensure idempotency. Same hash is used throughout retries, preventing divergent hashes from bcrypt's random salt. - Phase 1 (Bootstrap DB transaction): Set setup_state=in_progress and database_path, then commit. First checkpoint for crash detection. - Phase 2 (Filesystem): Initialize runtime database (idempotent) - Phase 3 (Runtime DB transaction): Batch-write all settings atomically - Phase 4 (Bootstrap DB transaction): Set setup_state=complete and setup_completed=1. Final commit point. 3. protocols.py: Add set_settings_batch to SettingsRepository protocol ### Testing - Added 6 new transactionality tests covering: - State machine transitions (None → in_progress → complete) - Password hash idempotency across retries - Atomic batch writes (all-or-nothing persistence) - Bootstrap DB state tracking - Database path propagation to both DBs - Recovery on partial failure - All 18 tests pass (12 existing + 6 new) ### Documentation - Updated Docs/Architekture.md with new section 6: - Setup state machine with state transitions - Transaction boundary documentation - Password hash idempotency rationale - Backward compatibility notes ## Design Decisions ### Why This Approach - Current code already idempotent via INSERT OR REPLACE, but password hash non-idempotency created silent inconsistency risk - Simpler than multi-state machine: 2 states sufficient for detection - Maintains backward compatibility (setup_completed key still written) - Explicit transactions make crash-safety obvious to future maintainers ### Crash Scenarios Now Handled 1. Crash after Phase 1 → detected by setup_state=in_progress on retry 2. Crash after Phase 2 → runtime DB may be partial, safe to retry 3. Crash after Phase 3 → runtime DB rolls back on next connection 4. Crash after Phase 4 → setup_completed detected, skipped Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-04-29 19:19:53 +02:00
Lukas	a273b96563	feat: Complete repository protocol coverage - Add missing protocol methods to Fail2BanDbRepository: - get_ban_event_counts: Aggregate ban events per IP (used in ban_service) - Add missing protocol methods to GeoCacheRepository: - delete_stale_entries: Remove old geo cache entries (used in geo_cache_cleanup) - Add missing protocol methods to HistoryArchiveRepository: - purge_archived_history: Remove archived entries older than age threshold - Add comprehensive protocol compliance tests: - Created test_protocol_compliance.py with 8 test classes - Validates all 7 repository modules fully implement their protocols - Prevents silent protocol drift when methods change signatures - Tests verify no unexpected public methods in repository modules - Update documentation: - Add Repository Protocol Coverage Checklist to Backend-Development.md - Document procedure for adding new repositories with protocol definitions - List current protocol coverage (all 7 repositories, 40 total methods) - All repositories now have 100% protocol coverage: - SessionRepository: 4 methods - SettingsRepository: 4 methods - BlocklistRepository: 6 methods - ImportLogRepository: 4 methods - GeoCacheRepository: 13 methods - HistoryArchiveRepository: 5 methods - Fail2BanDbRepository: 8 methods This ensures: - Enhanced mockability for testing - Static contract verification - Prevention of protocol drift - Better IDE support and type checking Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-04-28 07:58:57 +02:00
Lukas	52a4d04d92	Task 8: Standardize modeling style (TypedDict vs Pydantic) Convert inconsistent modeling style to standardized Pydantic models for all external-facing data structures while maintaining TypedDict compatibility where appropriate for internal layer-private structures. Changes: - Converted IpLookupResult TypedDict to use IpLookupResponse Pydantic model in jail_service.lookup_ip() for consistency with routers - Added GeoCacheEntry Pydantic model for geo cache repository rows - Converted GeoCacheRow TypedDict to use GeoCacheEntry alias - Converted ImportLogRow TypedDict to use ImportLogEntry alias - Updated routers and services to work with Pydantic models - Updated all tests to use Pydantic model field access (attributes) instead of dict subscripting Documentation: - Added 'Model Type Usage by Layer' section to Backend-Development.md - Defines when TypedDict is allowed (internal structures) vs Pydantic (external-facing, cross-boundary data) - Provides clear guidance on modeling conventions per layer Benefits: - Consistent validation and serialization behavior - Better IDE support and type checking - Clearer separation of concerns by layer - Reduced maintenance cost from mixed validation approaches Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-04-28 07:53:30 +02:00
Lukas	3888c5eb3f	Refactor ban management with domain models and mappers - Add ban domain model for core business logic separation - Implement mapper pattern for DTO/domain conversions - Update ban service with new domain-driven approach - Refactor router endpoints to use new architecture - Add comprehensive mapper tests - Update documentation with architecture changes Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-04-28 07:46:02 +02:00
Lukas	e2560f5db0	TASK-032: Implement geo_cache retention policy and cleanup Add automatic cleanup of stale geolocation cache entries to prevent unbounded database growth. Resolves the issue where unique IP addresses accumulated indefinitely in the geo_cache table, degrading query performance. ## Changes ### Database Schema (Migration 3) - Add 'last_seen' column to geo_cache table tracking last reference time - Existing entries default to current timestamp ### Repository Layer (geo_cache_repo.py) - Update upsert_entry() to set/refresh last_seen on insert/update - Update upsert_neg_entry() to set/refresh last_seen on negative cache hits - Update bulk_upsert_entries() to set/refresh last_seen in batch operations - Add delete_stale_entries(db, cutoff_iso) -> int for purging old entries ### Background Task (geo_cache_cleanup.py) - New APScheduler task that runs nightly (24-hour interval) - Calculates cutoff as 90 days ago from current time (UTC) - Deletes all entries with last_seen older than cutoff - Logs operation results (info when deleted > 0, debug when 0 deleted) - Configurable retention period via GEO_CACHE_RETENTION_DAYS constant ### Application Startup (startup.py) - Register geo_cache_cleanup task in scheduler during app startup - Placed after geo_cache_flush in task registration order ### Tests - Add delete_stale_entries test cases covering: * Removal of old entries beyond cutoff * No deletion when all entries are recent * Empty table edge case - Update existing test fixtures to include last_seen column - Add full test suite for cleanup task registration and execution ### Documentation - Architekture.md: Document cleanup task, update schema/diagram - Backend-Development.md: Add retention policy documentation ## Behavior When an IP is accessed, its last_seen is refreshed. After 90 days of no access, an IP is purged by the nightly cleanup. On next encounter, the IP is re-resolved from MaxMind MMDB or ip-api.com (if configured). This is acceptable because: 1. Stale geolocation data may become inaccurate over time 2. Re-resolution cost is minimal compared to unbounded storage growth 3. Active IPs maintain fresh data through their last_seen updates Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-04-26 19:24:34 +02:00
Lukas	81f009e323	TASK-022: Hash session tokens in database for security - Store session tokens as one-way SHA256 hashes instead of plaintext - Hash tokens on write (create_session) and on read (get_session, delete_session) - Add migration to drop plaintext sessions table and recreate with token_hash column - Update Session model: token field still contains raw token for signing - Add test to verify tokens are hashed in database, not plaintext - Update Architekture.md to document session token hashing - Update Backend-Development.md with implementation pattern and best practices Prevents direct session token hijacking if database file is exposed to attacker. If plaintext DB was readable, sessions are invalidated by the migration anyway. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-04-26 14:36:21 +02:00
Lukas	b6e8e3f5ff	Clean up unused imports and remove completed task - Remove TASK-016 from Docs/Tasks.md (completed) - Remove unused imports from protocols.py (Iterable, BanIpCount) - Remove unused imports from raw_config_io_service.py (asyncio, ConfigDirError, ConfigFileExistsError, ConfFileEntry) Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-04-26 14:08:43 +02:00
Lukas	667ab674ca	Fix SQLite LIKE wildcard escaping in IP filter queries - Add escape_like() helper to escape % and _ wildcards in LIKE queries - Update fail2ban_db_repo.get_history_page() to use escaping - Update history_archive_repo.get_archived_history() to use escaping - Add ESCAPE clause to all LIKE queries - Add comprehensive unit tests for escape_like function - Add integration tests for LIKE wildcard handling - Document LIKE escaping best practices in Backend-Development.md Fixes TASK-017: Prevent unintended LIKE matches when IP filter contains special characters like underscore or percent sign. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-04-26 14:07:49 +02:00
Lukas	b44b72053a	T-11: Validate repository Protocol structural compatibility — minimal approach (Option B) Problem: Repository modules use structural typing to satisfy Protocol interfaces via cast(). A function rename, parameter change, or signature mismatch would silently pass mypy but fail at runtime. Solution (Option B — minimal): 1. Aligned Protocol signatures in protocols.py with actual implementations: - BlocklistRepository: dict[str, object] → dict[str, Any] (matches implementation) - ImportLogRepository: dict[str, object] → ImportLogRow (typed model) - GeoCacheRepository: dict[str, object] → GeoCacheRow; Iterable → Sequence - HistoryArchiveRepository: dict[str, object] → dict[str, Any] - ImportLogRepository: async compute_total_pages → sync (matches implementation) 2. Created CI validation script (backend/scripts/validate_repository_protocols.py) that runs at build time to ensure all repository modules satisfy their Protocol interfaces. Exit 0 if valid, 1 if any mismatch. Detects: - Missing functions - Parameter count mismatches - Type annotation mismatches - Return type mismatches 3. Updated backend/app/dependencies.py with explicit docstrings linking each get_*_repo() provider to Backend-Development.md § 13.7.1, explaining the module-as-Protocol pattern and that it is intentional and validated. 4. Documented the pattern in Backend-Development.md § 13.7.1: 'Repository Module Pattern — Module-as-Protocol Structural Compatibility' explaining why the pattern works, risks (silent breakage), and how the validation mitigates it. 5. Fixed type annotation in history_archive_repo.py: - get_all_archived_history returns list[dict] → list[dict[str, Any]] - Imported Any type Benefits: - Prevents silent breakage of repository interfaces - Formalizes the module-as-Protocol pattern as intentional - CI validation prevents regressions without refactoring cost - All repository tests pass (53/53) - mypy --strict passes on modified files Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-04-25 18:59:49 +02:00
Lukas	c1f188643c	Move geo cache commit handling into repository layer	2026-04-18 20:10:05 +02:00
Lukas	db5b4cb77e	Add settings and history archive repository protocols and DI support	2026-04-17 20:54:08 +02:00
Lukas	56f03f39c7	Move history archive max timestamp query into repository	2026-04-15 21:18:44 +02:00
Lukas	09c764cebc	Task 25: extend service/repository protocol coverage and wire DI aliases	2026-04-14 12:32:42 +02:00
Lukas	3371ff8324	Introduce service/repository dependency protocols and tests	2026-04-10 19:51:19 +02:00
Lukas	c51858ec71	Add country-specific companion table filtering for map page	2026-04-05 22:12:06 +02:00
Lukas	9f05da2d4d	Complete history archive support for dashboard/map data and mark task finished Add source=archive option for dashboard endpoints and history service; update Docs/Tasks.md; include archive branch for list_bans, bans_by_country, ban_trend, bans_by_jail; tests for archive paths.	2026-03-28 12:39:47 +01:00
Lukas	0d4a2a3311	history archive purge uses current age and test uses dynamic timestamps	2026-03-24 20:52:40 +01:00
Lukas	cf721513e8	Fix history origin filter path and add regression tests	2026-03-22 20:32:40 +01:00
Lukas	1c0bac1353	refactor: improve backend type safety and import organization - Add TYPE_CHECKING guards for runtime-expensive imports (aiohttp, aiosqlite) - Reorganize imports to follow PEP 8 conventions - Convert TypeAlias to modern PEP 695 type syntax (where appropriate) - Use Sequence/Mapping from collections.abc for type hints (covariant) - Replace string literals with cast() for improved type inference - Fix casting of Fail2BanResponse and TypedDict patterns - Add IpLookupResult TypedDict for precise return type annotation - Reformat overlong lines for readability (120 char limit) - Add asyncio_mode and filterwarnings to pytest config - Update test fixtures with improved type hints This improves mypy type checking and makes type relationships explicit.	2026-03-22 14:24:24 +01:00
Lukas	bdcdd5d672	Fix geo_re_resolve async mocks and mark tasks complete	2026-03-22 14:24:24 +01:00
Lukas	c9e688cc52	Refactor geo cache persistence into repository + remove raw SQL from tasks/main, update task list	2026-03-22 14:24:24 +01:00
Lukas	1ce5da9e23	Refactor blocklist log retrieval via service layer and add fail2ban DB repo	2026-03-22 14:24:24 +01:00
Lukas	93f0feabde	Refactor geo re-resolve to use geo_cache repo and move data-access out of router	2026-03-22 14:24:24 +01:00
Lukas	1efa0e973b	Stage 10: external blocklist importer — backend + frontend - blocklist_repo.py: CRUD for blocklist_sources table - import_log_repo.py: add/list/get-last log entries - blocklist_service.py: source CRUD, preview, import (download/validate/ban), import_all, schedule get/set/info - blocklist_import.py: APScheduler task (hourly/daily/weekly schedule triggers) - blocklist.py router: 9 endpoints (list/create/update/delete/preview/import/ schedule-get+put/log) - blocklist.py models: ScheduleFrequency (StrEnum), ScheduleConfig, ScheduleInfo, ImportSourceResult, ImportRunResult, PreviewResponse - 59 new tests (18 repo + 19 service + 22 router); 374 total pass - ruff clean, mypy clean for Stage 10 files - types/blocklist.ts, api/blocklist.ts, hooks/useBlocklist.ts - BlocklistsPage.tsx: source management, schedule picker, import log table - Frontend tsc + ESLint clean	2026-03-01 15:33:24 +01:00
Lukas	750785680b	feat: Stage 2 — authentication and setup flow Backend (tasks 2.1–2.6, 2.10): - settings_repo: get/set/delete/get_all CRUD for the key-value settings table - session_repo: create/get/delete/delete_expired for session rows - setup_service: bcrypt password hashing, one-time-only enforcement, run_setup() / is_setup_complete() / get_password_hash() - auth_service: login() with bcrypt verify + token creation, validate_session() with expiry check, logout() - setup router: GET /api/setup (status), POST /api/setup (201 / 409) - auth router: POST /api/auth/login (token + HttpOnly cookie), POST /api/auth/logout (clears cookie, idempotent) - SetupRedirectMiddleware: 307 → /api/setup for all API paths until setup done - require_auth dependency: cookie or Bearer token → Session or 401 - conftest.py: manually bootstraps app.state.db for router tests (ASGITransport does not trigger ASGI lifespan) - 85 tests pass; ruff 0 errors; mypy --strict 0 errors Frontend (tasks 2.7–2.9): - types/auth.ts, types/setup.ts, api/auth.ts, api/setup.ts - AuthProvider: sessionStorage-backed context (isAuthenticated, login, logout) - RequireAuth: guard component → /login?next=<path> when unauthenticated - SetupPage: Fluent UI form, client-side validation, inline errors - LoginPage: single password input, ?next= redirect after success - DashboardPage: placeholder (full impl Stage 5) - App.tsx: full route tree (/setup, /login, /, *)	2026-02-28 21:33:30 +01:00
Lukas	7392c930d6	feat: Stage 1 — backend and frontend scaffolding Backend (tasks 1.1, 1.5–1.8): - pyproject.toml with FastAPI, Pydantic v2, aiosqlite, APScheduler 3.x, structlog, bcrypt; ruff + mypy strict configured - Pydantic Settings (BANGUI_ prefix env vars, fail-fast validation) - SQLite schema: settings, sessions, blocklist_sources, import_log; WAL mode + foreign keys; idempotent init_db() - FastAPI app factory with lifespan (DB, aiohttp session, scheduler), CORS, unhandled-exception handler, GET /api/health - Fail2BanClient: async Unix-socket wrapper using run_in_executor, custom error types, async context manager - Utility modules: ip_utils, time_utils, constants - 47 tests; ruff 0 errors; mypy --strict 0 errors Frontend (tasks 1.2–1.4): - Vite + React 18 + TypeScript strict; Fluent UI v9; ESLint + Prettier - Custom brand theme (#0F6CBD, WCAG AA contrast) with light/dark variants - Typed fetch API client (ApiError, get/post/put/del) + endpoints constants - tsc --noEmit 0 errors	2026-02-28 21:15:01 +01:00

35 Commits