phase 0+1: project scaffold, k8s manifests, CI pipeline, steering, hooks, tests
- Repository structure for all services, infra, lakehouse, dashboards - K8s manifests targeting stonks-oracle namespace with GHCR images - Ingress via Traefik with ca-issuer TLS for internal services - ConfigMap wired to existing cluster services (pg, redis, minio, ollama) - GitHub Actions workflow for lint, test, multi-service container builds - Dockerfile with build-arg CMD per service - Makefile for local build/push/deploy - Steering rules for TDD workflow, K8s conventions, project context - Agent hooks for lint-on-save, test-on-save, k8s-validate, phase-commit - Ruff linter config, all lint issues fixed - 14 passing tests for schemas, config, redis keys - PostgreSQL migrations, Trino catalogs, Superset config, MinIO lifecycle
This commit is contained in:
@@ -0,0 +1,21 @@
|
||||
-- Analytical fact table: document_extractions
|
||||
-- Partitioned by dt and model_version on MinIO
|
||||
|
||||
CREATE TABLE IF NOT EXISTS lakehouse.stonks.document_extractions (
|
||||
document_id VARCHAR,
|
||||
ticker VARCHAR,
|
||||
sentiment VARCHAR,
|
||||
impact_score DOUBLE,
|
||||
catalyst_type VARCHAR,
|
||||
confidence DOUBLE,
|
||||
novelty_score DOUBLE,
|
||||
model_name VARCHAR,
|
||||
prompt_version VARCHAR,
|
||||
extraction_at TIMESTAMP(6) WITH TIME ZONE,
|
||||
dt DATE,
|
||||
model_version VARCHAR
|
||||
) WITH (
|
||||
format = 'PARQUET',
|
||||
partitioned_by = ARRAY['dt', 'model_version'],
|
||||
external_location = 's3a://stonks-lakehouse/warehouse/document_extractions/'
|
||||
);
|
||||
Reference in New Issue
Block a user