ebea70573b
- Repository structure for all services, infra, lakehouse, dashboards - K8s manifests targeting stonks-oracle namespace with GHCR images - Ingress via Traefik with ca-issuer TLS for internal services - ConfigMap wired to existing cluster services (pg, redis, minio, ollama) - GitHub Actions workflow for lint, test, multi-service container builds - Dockerfile with build-arg CMD per service - Makefile for local build/push/deploy - Steering rules for TDD workflow, K8s conventions, project context - Agent hooks for lint-on-save, test-on-save, k8s-validate, phase-commit - Ruff linter config, all lint issues fixed - 14 passing tests for schemas, config, redis keys - PostgreSQL migrations, Trino catalogs, Superset config, MinIO lifecycle
21 lines
665 B
SQL
21 lines
665 B
SQL
-- Analytical fact table: documents
|
|
-- Partitioned by dt and source_type on MinIO
|
|
-- Path: s3://stonks-lakehouse/warehouse/documents/dt={yyyy-mm-dd}/source_type={type}/part-*.parquet
|
|
|
|
CREATE TABLE IF NOT EXISTS lakehouse.stonks.documents (
|
|
document_id VARCHAR,
|
|
document_type VARCHAR,
|
|
source_type VARCHAR,
|
|
ticker VARCHAR,
|
|
publisher VARCHAR,
|
|
title VARCHAR,
|
|
published_at TIMESTAMP(6) WITH TIME ZONE,
|
|
content_hash VARCHAR,
|
|
confidence DOUBLE,
|
|
dt DATE
|
|
) WITH (
|
|
format = 'PARQUET',
|
|
partitioned_by = ARRAY['dt'],
|
|
external_location = 's3a://stonks-lakehouse/warehouse/documents/'
|
|
);
|