hyungi_document_server

Author	SHA1	Message	Date
Hyungi Ahn	e8c348ab21	feat(dashboard): Day 4 튜닝 — 임계치 재조정 + deep_summary 안정성 카드 3일 telemetry (599 triage / 555 deep) 기반 임계치 재평가: 1. 에스컬레이션 비율 — 임계치 의미 reframe - 기존: >20% 적색 (튜닝 필요) → 항상 적색 (운영 패턴 97%) - 신규: <80% 적색 (정책 매칭 실패 증가) - 메시지: "safety 정책상 95~100% 가 정상" 보조 표시 - safety_reference 99.7%, generic 100% (fallback risk_flag), msds 46.2% → 운영 정상 패턴 확인 2. Deep summary 안정성 — 신규 카드 추가 - mode='summary_deep' 의 error_code IS NOT NULL 비율 - 현재 5.2% (call_failed 21 + parse:ValidationError 8) - >5% 적색 임계 - MLX 호출 timeout / JSON 파싱 실패 모니터 3. triage JSON 건강도, Backlog Suppression — 임계치 유지 - 현재 0%, 1% — 매우 안정. 보수적 임계 유효. Backend: TierHealthStack 에 deep_total / deep_err_total 추가 Frontend: 카드 그리드 3열 → 4열 (lg), Day 4 신규 카드. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-27 08:29:53 +09:00
Hyungi Ahn	95bcdb851b	fix(ops): backfill 쿼리에 빈 extracted_text 제외 — 무한 retry 방지 3일 운영 결과 doc 4811, 5181 가 extracted_text='' (빈 문자열) 인데 IS NOT NULL 만 걸려 enqueue → classify_worker 의 not doc.extracted_text truthy 체크에서 ValueError → max_attempts(3) 도달 → status=failed. 다음 backfill 사이클에서 다시 enqueue 되어 12회 반복, failed 24건 누적. 수정: tier_backfill.py + backfill_tier.py 양쪽 SQL 에 LENGTH(extracted_text) > 0 추가. 빈 문자열 문서는 enqueue 자체에서 제외. 기존 failed 24건 정리 SQL (사용자가 수동 실행): DELETE FROM processing_queue WHERE stage='classify' AND status='failed' AND error_message LIKE '%extracted_text%'; Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-27 08:25:12 +09:00
Hyungi Ahn	10364fbe1b	fix(study): refresh assets attr on create to avoid async lazy load	2026-04-27 08:20:29 +09:00
Hyungi Ahn	2df7b24ac9	fix(study): split migration 164 into 10 single-statement files (asyncpg) asyncpg prepared statement 는 single-command 만 허용. 원래 한 파일이던 study_sessions 스키마(CREATE TABLE x2 + CREATE INDEX x8)를 143~146 분할 패턴 따라 10개로 분리. 164: CREATE TABLE study_sessions 165~169: study_sessions 인덱스 5개 (partial) 170: CREATE TABLE study_session_assets 171~173: study_session_assets 인덱스 3개 문제: cannot insert multiple commands into a prepared statement 원인: _run_migrations 가 conn.exec_driver_sql 로 단일 prepared statement 실행	2026-04-27 08:18:40 +09:00
Hyungi Ahn	7804f22dce	feat(study): study_sessions backend (Phase 1) — 자격증/어학 일반 학습 세션 + assets 연결 iPad 손글씨 필사 / 모바일 암기노트 / 모바일 퀴즈가 같은 데이터를 공유하는 일반 학습 세션 backend. study_type 으로 certification/language 분기. - migrations/164: study_sessions + study_session_assets DDL + 5 partial indexes - app/models/study_session.py: StudySession + StudySessionAsset ORM (cascade) - app/api/study_sessions.py: CRUD + snapshot(PNG) + assets + filter + groups - ownership: 모든 endpoint user_id 검증, mismatch 도 404 (정보 누설 방지) - 409 중복: UNIQUE(session, document, asset_type, role) 사전 SELECT + IntegrityError 폴백 - enum 422: study_type / mode / asset_type / role / review_state / order - filter: 11개 (study_type, certification, language_code, learning_level, subject, topic, review_state, document_id, asset_type, mode, due_before) - groups: certification 트리 + language 트리 + has_audio/has_video - snapshot: documents.py atomic rename + error_code 패턴 차용 - app/main.py: /api/study-sessions router 등록 plan: ~/.claude/plans/scalable-chasing-stonebraker.md Phase 1 미사용 필드 (review_state/quiz/ocr/ai_summary/prompt) 는 NULL 허용, 자동 로직은 Phase 2~4 별도 PR 에서 활성.	2026-04-27 08:15:28 +09:00
Hyungi Ahn	c6335c9a1e	fix(classify): law_monitor skip 분기 복원 + tier_backfill law 제외 PR-B refactor 과정에서 `e88640d` 의 process() 진입부 source_channel='law_monitor' skip 분기가 사라져 매일 07:00 신규 법령 분할마다 26B legacy classify(8s) + 26B legacy summarize(10s) + 4B triage(1.5s) 전부 호출되고 있었다. 법령 분리 PR (stateless-churning-raccoon) 의 명제: "법령은 외부 source-of-truth + immutable + 자동 재수집 → 다른 수명주기" 와 일치하도록 process() 진입부에 skip 분기 복원. 최소 필드 (ai_domain='법령', ai_tags=['법령'], importance='medium') 만 세팅 후 return. queue_consumer 의 NEXT_STAGES['classify']=['embed','chunk'] 가 자동 chain 하므로 검색 영향 0. 법령 도메인 AI 산출물 가치 분석: - ai_summary: 법령 해석 환각 위험 (ASME/안전 엔지니어 사고 책임 소지) - ai_tldr/bullets: 이미 title 이 같은 정보 노출 — redundant - ai_inconsistencies: 공식 정합 문서라 100% false positive → 비용 (월 ~14분 26B 점유) 대비 가치 음수, skip 합당. tier_backfill.py 도 함께 수정: - DOMAIN_PRIORITY 에서 ('law', source_channel='law_monitor') 항목 제거 - safety 필터에 source_channel != 'law_monitor' 추가 (기존 ai_domain LIKE 'Industrial_Safety%' 매칭 안에 backfill 기 처리한 법령 doc 들이 잡혀 들어가는 case 차단) - 사유: skip 처리될 doc 을 enqueue 하면 야간마다 enqueue→skip→NULL→ enqueue 무한 루프 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-27 07:35:27 +09:00
Hyungi Ahn	8427ac886c	feat(memo): sync content ↔ memo_task_state on create/update + backfill script 본문에 `- [x]` 로 직접 입력된 체크 항목도 checked_at 가 기록되어 10초 후 자동 숨김 대상이 되도록 create_memo / update_memo 에 sync 로직 추가. - _sync_task_state_with_content: - [x] 에 checked_at 없으면 현재 시각으로 기록, - [ ] 또는 사라진 index 는 state 에서 정리 - scripts/backfill_memo_task_state.py: 배포 이전 기존 노트에 현재 시각 backfill (docker compose exec fastapi python /app/scripts/backfill_memo_task_state.py --apply)	2026-04-24 15:40:18 +09:00
Hyungi Ahn	a95294ff42	feat(ops): 야간 auto tier 백필 스케줄러 (PR-B 레거시 해소) 6720건 레거시 문서를 야간에 자동으로 tier triage + deep_summary 처리. app/workers/tier_backfill.py (신규): - APScheduler 30분 주기 트리거. KST 00:00~06:00 시간대만 실제 enqueue. - safety > law > manual 우선순위 25건씩 classify 큐 재투입. - classify 큐 40건 이상 쌓여있으면 MLX 부하 보호로 skip. - drive_sync / memo / news 는 제외 (plan 스코프 밖 또는 가치 낮음). - off-switch: settings.ai.tier_backfill.enabled = false 로 전면 중단 가능. app/main.py lifespan: - scheduler.add_job(tier_backfill_run, interval=30min, id='tier_backfill'). - AsyncIOScheduler 이미 timezone='Asia/Seoul' 로 설정돼 tier_backfill 내부의 zoneinfo('Asia/Seoul') 와 일치. 수치 예상: 야간 6시간 × 2회/시간 × 25건 = 150건/야간. 6720 / 150 = 약 45일이면 전체 레거시 소화. MLX 부하 제어가 가장 강한 관심 — R2 backlog guard 와 중복 안전장치. 운영 중 과부하 감지 시: config.yaml 에 `ai.tier_backfill.enabled: false` 만 넣으면 즉시 정지 (재시작 없이 스케줄러가 매번 체크). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 15:28:28 +09:00
Hyungi Ahn	814882a0fe	feat(ops): tier triage 레거시 백필 스크립트 PR-B B-1 배포 이전에 classify 된 6770건 레거시 문서에 대해 ai_tldr / ai_bullets / ai_detail_summary 등 tier 산출물을 채우기 위한 백필 도구. 사용: docker exec hyungi_document_server-fastapi-1 \ python /app/scripts/backfill_tier.py --domain safety --limit 50 --dry-run docker exec hyungi_document_server-fastapi-1 \ python /app/scripts/backfill_tier.py --domain safety --limit 50 --apply 도메인 필터: safety / law / manual / news / drive_sync / memo ORDER BY created_at DESC 로 최신 우선. ON CONFLICT DO NOTHING 이라 기존 pending/processing 행 있으면 중복 enqueue 방지. MLX 26B 단일 Semaphore 경로라 처리 속도 ~1건/분. 50건 ≈ 1시간. 대량 백필은 야간 분할 권장. 이번 세션 Industrial_Safety 50건이 첫 smoke 대상. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 13:30:31 +09:00
Hyungi Ahn	320c02fe29	fix(memo): bump migration number 161 → 163 (collision with PR-B 161/162) cherry-pick 시점에 main 이 PR-B B-2 (161_analyze_events_answerability + 162_analyze_events_answerability_idx) 까지 포함해 번호 충돌. fastapi 부팅 시 _parse_migration_files 가 "migration 버전 중복: 161_..." 로 RuntimeError. 163 로 재지정 (schema_migrations 의 기존 161/162 레코드는 그대로 유효).	2026-04-24 12:59:39 +09:00
Hyungi Ahn	9d344c87ea	feat(memo): auto-hide completed tasks after 10s with toggle 체크박스 체크 후 10초 경과 항목을 대시보드 핀 메모 / /memos 에서 자동 숨김, 메모 푸터 "완료 N개 보기" 버튼으로 토글. - migration 161: documents.memo_task_state JSONB — {"<idx>":{"checked_at":"ISO"}} - PATCH /memos/{id}/tasks/{task_index} 전용 엔드포인트: · SELECT FOR UPDATE 로 동시 토글 race 차단 · task_index drift 시 stale state 자동 정리 (400 대신 200) · AI 재처리/큐 enqueue 의도적 스킵 + memo_task_toggle_skip_ai 로그 - renderMemoHtml(taskStates, now) → 경과 항목에 memo-task-hidden 클래스 - Svelte 5 $effect cleanup 으로 setInterval 누수 방지	2026-04-24 12:56:55 +09:00
Hyungi Ahn	ebc37961e0	fix(memo): prevent title overwrite on checkbox patch 체크박스 토글 같은 {content}-only PATCH 에서 body.title==None 을 무조건 _auto_title(content)로 재생성해 제목이 체크박스 라인으로 덮어씌워지는 버그. Pydantic model_fields_set 으로 title 전송 여부를 구분해 PATCH semantics 정상화.	2026-04-24 12:56:51 +09:00
Hyungi Ahn	e2b32fe9b7	fix(ai): B-1 R2 risk_flag_requires_26b 를 hard escalate 로 승격 실측 발견 (safety 8건 재분류): - 10574 KRAS (safety_operational) → escalate=true (guard 전 pass) - 10568 JSA (safety_operational) → escalate=false suppressed=True - 10570 PPE (safety_operational) → escalate=false suppressed=True - 동일 도메인인데 4건 중 1건만 26B 처리. 같은 질의 종류 문서가 누구는 깊이 있고 누구는 짧음 → 사용자 관점 일관성 붕괴. 원인: risk_flag_requires_26b 가 soft escalate 분류 → R2 backlog guard 의 ratio 임계치(0.3) 에 걸림. 방금 classify 8건 enqueue 중 앞선 건들이 deep_summary 큐 채우자 뒤 건들이 전부 suppress. 수정: HARD_ESCALATE_REASONS 에 risk_flag_requires_26b 추가. safety/ health/chemical 등 도메인 정책 기반 escalate 는 절대 억제하지 않음. soft 영역은 여전히 남아있음: self_declare (4B 자가선언), deep_requested (recommend_deep_summary). 이 둘만 backlog guard 가 억제 대상. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 12:33:12 +09:00
Hyungi Ahn	93a687a51d	fix(ai): B-1 deep_summary 잘린 응답 field-level regex fallback _parse_outermost_json 도 열린 문자열 중간에 응답 끊기면 실패. 실전 MLX 응답이 entities_confirmed 내부 문자열에서 종료되는 패턴이라 detail/tldr/bullets/inconsistencies 전부 손실되던 이슈. _regex_extract_fields helper 추가: "key":"value" 쌍 개별 매칭으로 앞쪽 완결된 필드만이라도 건진다. detail 이 응답 앞부분에 있어 잘림 지점보다 앞이면 성공. 순서: 1. _parse_outermost_json (brace balance) 2. parse_json_response (기존 regex) 3. _regex_extract_fields (field-level fallback) entities_confirmed 제거 같은 프롬프트 수정은 PR-A 영역이라 건드리지 않고, PR-B 워커에서 방어. 근본 해결은 p3c_deep_summary 에서 불필요 필드 제거 또는 max_tokens 튜닝을 policy 소유자가 결정. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 11:27:04 +09:00
Hyungi Ahn	154cb1c8bd	fix(ai): B-1 deep_summary JSON parser 강건화 (최외곽 JSON 추출) 실측 버그 (doc 10573 산업안전보건법 deep 처리): - 26B MLX 응답 길이 1131자 (8192 token 한도 미도달) 에서 응답이 \`entities_confirmed\` 섹션 중간에 잘림. - parse_json_response 의 regex \`{[^{}](?:{[^{}]}[^{}])}\` 가 1단계 중첩까지만 매칭 + reversed 순회로 "가장 마지막 valid JSON" 우선 반환. - 결과적으로 entities_confirmed 내부 객체 (\`{"people":[],"orgs":[],...}\`) 가 파싱돼 detail/tldr/bullets 전부 손실 → ai_detail_summary 빈값. 수정: deep_summary_worker 에 \`_parse_outermost_json\` helper 추가. brace balance + 문자열 리터럴 인식으로 첫 '{' 부터 최외곽 '}' 까지 추출. 응답이 잘려 closure 없으면 남은 depth 만큼 '}' 보강 후 재시도 (partial 응답도 최대한 복구). parse_json_response 는 fallback. 이 수정 후 doc 10573 재처리 smoke 필요. entities_confirmed 필드는 정보창 UI 에 안 쓰므로 응답에서 제거하는 프롬프트 조정은 다음 라운드. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 11:25:01 +09:00
Hyungi Ahn	165b00f917	fix(ai): B-1 subject_domain 매칭 + RoutingDecision.escalate_to_26b 존중 실측 발견 (safety md 8건 tier triage 결과): 1. 분류 오분류: 본문에 "MSDS" 한 번 스쳐도 msds 도메인 매칭됨. 개인보호구/중대재해/밀폐공간/산업안전보건법 전부 msds 로 잘못 판정. 2. RoutingDecision 무시: PR-A domain_policy 의 high_impact=true 와 risk_flag_requires_26b 때문에 RoutingDecision.escalate_to_26b=True 이지만 내 _classify_escalation_reason 이 이걸 안 봐서 escalate=False 로 마감. safety/msds/hazard_specific 전부 4B 만 돌고 26B 정책 우회. 수정: - _match_subject_domain: (a) title 기반 매칭 우선 추가 — 파일명이 의도의 1차 시그널. (b) 본문 키워드는 2회 이상 등장해야 match (single-mention 오분류 방지). 우선순위도 재배열 (msds 맨 앞 → hazard/safety 뒤로). - _classify_escalation_reason: routing_decision 파라미터 추가. 4B 자체 판정 (long_context / low_confidence / self_declare / deep_requested) 이후 PR-A routing_decision.escalate_to_26b 가 True 이면 그 escalation_reasons 중 "high_impact" 외의 구체 사유(risk_flag_requires_26b 등) 를 채택. - _run_tier_triage: routing_decision 을 먼저 계산하여 _classify_escalation_reason 에 전달. _apply_triage_result 는 routing_decision 을 param 으로 받음 (중복 계산 제거). 이 변경 후 safety/msds/hazard_specific/incident_report 도메인 문서는 항상 26B escalate → deep_summary 큐. MLX 부하 증가하지만 plan 의도대로 정책 준수. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 11:18:59 +09:00
Hyungi Ahn	f872e4666f	fix(ai): B-1 envelope.from_stage PR-A enum 값으로 정정 doc 5260 (confidence 0.3 low_confidence 에스컬레이션) 실측에서 발견: EscalationEnvelope(from_stage='summary_triage') 가 PR-A ValidFromStage ({triage, summarize_short, advice_trigger, classify, night_sweep, ask_pre, unknown}) 에 없어 ValueError 발생 → 모든 deep_summary enqueue 가 envelope 생성 단계에서 터짐. tldr/bullets 기록은 envelope 실패 전에 완료되어 영향 없음 (try/except 가 classify 전체는 보호). P3a short summary 에서의 에스컬레이션 의미에 맞춰 'summarize_short' 로 변경. 내부 task 이름 (SUMMARY_TRIAGE_TASK = 'p3a_short_summary') 는 analyze_events. prompt_version 기록 전용이라 그대로 유지. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 11:04:47 +09:00
Hyungi Ahn	04f9eb6582	feat(ui): B-3 정보창 tier 자동 표시 + 대시보드 3종 카드 정보창 (AnalysisPanel): - doc prop 추가. doc.ai_tldr / ai_bullets / ai_detail_summary / ai_inconsistencies 있으면 버튼 없이 자동 렌더 (Section A). - tier 배지 (triage=흰 / deep=파랑) + tldr + bullets + detail 계층 카드. - inconsistencies kind 별 아이콘: version_drift=Calendar / procedure_conflict= GitBranch / source_conflict=Quote / missing_basis=HelpCircle. warning 톤. - 기존 "고급 분석" 버튼 (/documents/{id}/analyze 4층 응답) 은 Section B 로 유지. AIClassificationEditor: - 제목 옆 tier 배지 ("깊이" accent / "짧음" neutral) — ai_analysis_tier 값 기준. 대시보드 (B-3 3종 카드): - "에스컬레이션 비율 (24h)": escalated_to_26b / triage_total. 20% 초과 적색, 1% 미만 회색 (false negative 신호). reason 상위 4개 뱃지. - "triage JSON 건강도 (24h)": error_code='triage_json_invalid' / triage_total. 5% 초과 적색 (프롬프트/모델 이슈). - "Backlog Suppression (24h)": suppressed_reason IS NOT NULL / triage_total. 10% 초과 주황 (임계치 재조정 신호). Backend: - dashboard.py 에 TierHealthStack 모델 + analyze_events 24h 집계 쿼리. - escalation_by_reason (unnest(escalation_reasons)) + escalation_by_domain (subject_domain) 서브 집계. Frontend types: - stores/system.ts DashboardSummary 에 tier_health 옵셔널 필드 추가. UI 는 PR-A shadow 기간에도 tier_health.triage_total > 0 조건으로 조건부 표시 — 데이터가 없으면 카드 자체가 숨겨져 첫 삽입 시 UX 충격 0. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 10:38:53 +09:00
Hyungi Ahn	34f79f84f2	feat(search): B-2 evidence LLM → 4B triage 전환 + answerability 컬럼 Plan 본래 의도: 근거 선별은 4B, 합성은 26B. - evidence_service: LLM 호출을 primary(26B MLX) → triage(4B Ollama) 로 전환. Ollama concurrent 가능하므로 get_mlx_gate() 제거. synthesis 는 여전히 llm_gate Semaphore(1) 경유로 MLX 보호. - prompt_version v3-evidence-triage bump (synthesis 프롬프트 자체는 v2-600char 그대로, evidence LLM 경로 변경을 분리 추적). - migrations 161/162: analyze_events 에 answerability / partial_basis / suggested_query_count 컬럼 + partial index. /ask 는 이미 ask_events 에 completeness (full/partial/insufficient) 기록 운영 중이므로, analyze_events 쪽은 향후 문서 분석에서 answerability 개념 도입 시 활용 예비. - telemetry record_analyze_event 에 answerability / partial_basis / suggested_query_count 파라미터 확장. 기존 /ask 3-state completeness 로직 (classifier_service + 7-tier gate) 은 그대로 유지 — 이미 Phase 3.5a 에서 완성된 상태. B-2 는 LLM 부하 재분배와 관측성 확장에 집중. MLX 부하 감소 효과: 이전엔 쿼리 1건당 evidence(26B) + synthesis(26B) 2번 MLX 호출. 이제는 evidence(4B Ollama) + synthesis(26B MLX) 로 MLX 호출 절반. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 10:33:32 +09:00
Hyungi Ahn	6fdc48e5b6	feat(ai): B-1 summary tier 분할 — triage(4B) + deep_summary(26B) PR-A policy 레이어를 재사용하여 classify_worker 에 tier triage 경로를 추가. Legacy ai_summary / ai_domain / ai_suggestion 은 유지 (회귀 0), tldr/bullets/ detail/inconsistencies 는 별도 필드로 분리. Migrations (156~160): - 156 documents: ai_tldr, ai_bullets, ai_detail_summary, ai_inconsistencies, ai_analysis_tier 5컬럼 - 157 process_stage 에 'deep_summary' ADD VALUE 단독 (Postgres 동일 트랜잭션 제약 회피) - 158 processing_queue.payload JSONB (envelope 전달) - 159 analyze_events 에 tier + suppressed_reason - 160 suppressed_reason partial index Models/ORM: - Document: 5컬럼 Mapped 추가 - ProcessingQueue: deep_summary enum 확장 + payload 필드, enqueue_stage 에 payload 옵션 - AnalyzeEvent: PR-A shadow 6컬럼 + PR-B tier/suppressed_reason Workers: - classify_worker: 기존 legacy 경로 뒤에 _run_tier_triage 추가. - _match_subject_domain(doc, text): source_channel + 본문 keywords + ai_domain prefix 로 PR-A policy 의 subject_domain 이름 결정 (category 매칭 금지). - R1 TriageOutput pydantic + JSON 깨짐 fallback (triage_json_invalid). - R2 _check_backlog_guard(): 30분 window ratio > threshold OR pending 초과면 soft escalate suppress. hard escalate 는 통과. - R3 _slice_text_ranges(): 260k 초과 시 head 120k + mid 20k + tail 120k 3조각. - escalate 시 EscalationEnvelope 구성 + {envelope, subject_domain} payload 로 deep_summary enqueue. - deep_summary_worker (신규): queue payload 에서 envelope + subject_domain 읽기 → render_26b("p3c_deep_summary", subject_domain) + MLX 호출 (llm_gate Semaphore(1) 경유) → ai_detail_summary + ai_inconsistencies 저장 + ai_analysis_tier='deep'. _filter_inconsistencies 로 허용 kind (version_drift / procedure_conflict / source_conflict / missing_basis) 만 통과 — 구매/계약 kind drop. - queue_consumer: workers dict 에 deep_summary 추가 + BATCH_SIZE=1. next_stages 는 건드리지 않음 — classify → embed/chunk 는 그대로, deep_summary 는 독립 체인. Telemetry: - record_analyze_event: subject_domain / risk_flags / escalation_reasons / confidence / policy_version / shadow_would_route_to / tier / escalated_to_26b / suppressed_reason 파라미터 확장. classify/deep worker 가 mode="summary_triage" 또는 "summary_deep" 로 기록. API: - DocumentResponse 에 ai_tldr / ai_bullets / ai_detail_summary / ai_inconsistencies / ai_analysis_tier 5필드 노출. Prompts: - classify.txt 에 DEPRECATED 주석만 추가 (파일 유지 — rollback 경로 보존). - PR-A 의 app/prompts/policy/p3a_short_summary.txt (4B) 와 p3c_deep_summary.txt (26B) 를 그대로 사용. 내 소유의 summary_triage.txt / summary_deep.txt 는 중복 이라 별도 커밋에서 제거하지 않고 바로 생성 전 삭제. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 10:22:40 +09:00
Hyungi Ahn	18d684b501	ops(infra): STT Mac mini 이전 + classifier 섹션 복원 (gemma4:e4b) - docker-compose.yml stt-service 를 profiles:[legacy] 로 이동. GPU 의 stt-service 는 더 이상 기동하지 않고, fastapi STT_ENDPOINT 가 Mac mini (기본 100.76.254.116:8804 Tailscale, MAC_MINI_HOST env 로 LAN IP 주입) 를 바라보도록 변경. 복원 필요 시 `docker compose --profile legacy up -d stt-service`. - config.yaml: classifier 섹션을 gemma4:e4b-it-q8_0 으로 복원. 이전 B-0 커밋이 classifier 를 주석 처리했는데, 실제로는 classifier_service 가 쓰고 있어 gate 유효. exaone 은 이미 제거됐으니 모델만 gemma4 로 통일. classifier_service 의 hasattr 체크는 유지되어 fallback 안전. D13 (STT 이전) drift 를 main 으로 승격. inventory 갱신은 B-3 마감 단계에서 3-tier + STT 경로 묶어서 일괄. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 10:08:00 +09:00
Hyungi Ahn	490bef1136	feat(ai): B-0 3-tier routing — triage/primary/fallback 슬롯 + AIClient - config.yaml: ai.models 에 triage (gemma4:e4b-it-q8_0, GPU Ollama, context_char_limit=120k, timeout 30s) 신규. primary (MLX gemma-4-26b) 는 에스컬레이션 전용 역할 명시. fallback 을 gemma4:e4b 로 통일 (exaone 제거 이미 반영). classifier/verifier 는 optional 유지, vision 은 optional 로 완화 (미사용 정리 준비). - core/config.py: AIConfig 에 triage 필드 추가, vision 은 Optional 로 전환. AIModelConfig.context_char_limit + DeepSummaryBacklogConfig (R2 backlog guard 임계치 ratio 0.3 / pending 5 / window 30min) 스키마 신설. load_settings 가 models.get("vision") graceful. - ai/client.py: call_triage / call_primary / call_fallback 3-tier 진입점 신규. primary 는 caller 가 get_mlx_gate() 블록 안에서 호출 해야 한다는 계약 docstring. classify/summarize 는 DEPRECATED 주석 만 추가, 기존 호출부 (eval runner 등) 를 위해 유지. PR-B B-0 Day 1. 기존 primary 경로 변경 없음 — 회귀 0 기대. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 10:05:24 +09:00
Hyungi Ahn	628d886cba	fix(policy): mount domain_policy.yaml into fastapi + multi-path loader 배포 검증 중 발견: domain_policy.yaml 이 repo root 에 있지만 fastapi 컨테이너의 build context 는 ./app 이라 COPY 가 포함하지 못함. 결과 load_policy() 가 FileNotFoundError. 1. docker-compose.yml: config.yaml 과 동일 패턴으로 읽기전용 bind mount - ./domain_policy.yaml:/app/domain_policy.yaml:ro 2. app/policy/loader.py: _resolve_path 에 4 개 후보 검색 추가 — cwd / /app / /app/.. / <this>.parent.parent.parent 순으로 파일 존재 확인. 첫 매칭 반환. 로컬/컨테이너/다른 배포 환경 모두 호환. CI: pytest tests/policy/ -q → 98 passed. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 09:45:10 +09:00
Hyungi Ahn	99672292d3	fix(policy): use container-compatible imports (drop app. prefix) 프로덕션 컨테이너는 /app 을 cwd 로 실행하고 import 는 `from api...`, `from core...`, `from workers...` 처럼 무접두 스타일을 사용한다. PR-A 내부 import 가 `from app.policy...`, `from app.ai.envelope` 로 되어 있어서 컨테이너에서 ModuleNotFoundError 발생. 변경: - app/policy/.py: `from app.policy.X` → `from policy.X` - app/services/prompt_versions.py: lazy import 도 `from policy.prompt_render` - app/ai/envelope.py: 영향 없음 (내부 import 없음) - tests/policy/.py: 모두 `from policy.X` / `from ai.envelope` 로 통일 - tests/policy/conftest.py: 로컬 pytest 용 sys.path.insert(app/) 추가 (MacBook 에서 repo-root 기준 실행 시 app/ 를 package root 로 취급) CI: pytest tests/policy/ -q → 98 passed (로컬, 동일 결과) 프로덕션: docker exec fastapi python -c "from policy.loader import load_policy" → OK Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 09:42:24 +09:00
Hyungi Ahn	c9e8dd0ba1	fix(db): split migration 153 for asyncpg single-statement limit asyncpg exec_driver_sql 이 prepared statement 로 multiple commands 를 허용하지 않아 배포 시 PostgresSyntaxError: cannot insert multiple commands into a prepared statement 로 init_db() 실패. 153 를 단일 ALTER TABLE (10 ADD COLUMN) 로 축소하고 2 partial index 를 154/155 로 분리: - 153_analyze_events_shadow.sql: ALTER TABLE ADD COLUMN (단일 statement) - 154_analyze_events_shadow_idx_ts.sql: idx_analyze_events_shadow_ts - 155_analyze_events_policy_violation_idx.sql: idx_analyze_events_policy_violation 배포 test: GPU fastapi 컨테이너 재빌드 후 init_db 가 153/154/155 세 파일을 순차 적용 (asyncpg prepared statement 1 파일 1 문). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 09:38:40 +09:00
Hyungi Ahn	ba97766d45	feat(policy): INV-1~6 테스트 + loader/audit/envelope/shadow 검증 tests/policy/ 7개 테스트 파일 + conftest + __init__. 98 tests passed. 커버: - test_policy_loader_schema.py (9) — yaml 로드, cross-reference, unknown flag reject, invalid UI category reject, synthesis_directive 500 chars 초과 reject - test_self_declare_add_only.py (4) — INV-1 invariant 엄격 검증 - test_routing_decisions.py (27) — INV-2~6 + low_confidence + 도메인 × 시나리오 parametrize (9 도메인 x 기본 시나리오) - test_audit_patterns.py (11) — detection_patterns 양성/음성, 도메인 미스매치, 빈 텍스트 엣지 - test_envelope_contract.py (6) — JSON round-trip, invalid from_stage reject, tuple 강제 - test_prompt_render.py (16) — 모든 템플릿 렌더, placeholder 치환, policy_version deterministic/yaml-sensitive hash - test_shadow_logger_inmem.py (5) — record/clear/multiple/extra/ Protocol 호환 conftest.py: autouse _clear_policy_cache fixture — lru_cache 로 인한 테스트 간 오염 방지. policy fixture 는 repo root domain_policy.yaml 로드. plan: ~/.claude/plans/wise-gliding-hippo.md Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 09:34:49 +09:00
Hyungi Ahn	301867d0ee	feat(db): migration 153 — analyze_events shadow 컬럼 ALTER TABLE analyze_events ADD COLUMN IF NOT EXISTS 로 10개 shadow 컬럼: subject_domain, risk_flags[], high_impact_task, escalated_to_26b, escalation_reasons[], confidence, policy_violation, policy_violation_ids[], shadow_would_route_to, policy_version. + 2 partial index: - idx_analyze_events_shadow_ts (shadow_would_route_to IS NOT NULL) - idx_analyze_events_policy_violation (policy_violation=true) 전부 nullable, 기본값 NULL. 아무도 쓰지 않음 — PR-B 의 DBShadowLogger 가 writer 추가 예정. 번호 153: 152 는 `feat(category): law` 가 점유 (`e88640d`). BEGIN/COMMIT 없음 (CLAUDE.md: _run_migrations 단일 outer 트랜잭션). answerability / new_facts_count 는 PR-B 의 migration 154+ 가 소유. plan: ~/.claude/plans/wise-gliding-hippo.md Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 09:34:49 +09:00
Hyungi Ahn	23b8a555c2	feat(prompts): policy templates (p1~p6, 9 files) app/prompts/policy/*.txt — 4B/26B 정책 템플릿. {forbidden_block} / {subject_description} / {confidence_threshold} / {context_cap} placeholder 포함. 금지 규칙 하드코딩 0 건. 4B (7): p1_triage, p2_nas_rule, p3a_short_summary, p3b_entities, p4a_advice_trigger, p4b_retrieval, p6_night_sweep 26B (2): p3c_deep_summary, p4b_synthesis 각 템플릿 공통 구조: - [System] 역할 선언 + subject_description - forbidden_block (yaml 에서 도메인별 렌더) - 작업 규칙 - 출력 형식 (JSON only, escalate_to_26b 포함) - 에스컬레이션 기준 - [User] 실행시 치환 placeholder (이중 중괄호) render 호출은 PR-A 에서 아무도 하지 않음 — 자산 배치만. PR-B escalation_service 가 실제 worker 에서 render. plan: ~/.claude/plans/wise-gliding-hippo.md Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 09:34:48 +09:00
Hyungi Ahn	b9cc7f0ae1	feat(policy): shadow Protocol + InMemoryShadowLogger ShadowLogger (runtime_checkable Protocol) — PR-B 가 DBShadowLogger 구현 시 준수해야 할 인터페이스. record_would_route(*, doc_id, decision, actual_model_used, prompt_version, policy_version, extra=None) → None. InMemoryShadowLogger — 테스트 전용 in-memory 구현. records/count/clear inspection helpers. Protocol 호환 (isinstance 통과). PR-B 책임: app/services/policy_shadow_writer.py::DBShadowLogger(ShadowLogger) 구현 — analyze_events 에 INSERT. DB write 실패 시 WARN 로그만, 본 파이프라인 중단 금지 (shadow 기간 제품 영향 0). plan: ~/.claude/plans/wise-gliding-hippo.md Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 09:34:48 +09:00
Hyungi Ahn	f51583f9d6	feat(policy): prompt_render + policy_version hash app/policy/prompt_render.py: - render_4b(task, subject) / render_26b(task, subject) — template + yaml excerpt 주입. {forbidden_block} / {subject_description} / {confidence_threshold} / {context_cap} placeholder 치환. - policy_version(task) → sha256(yaml_bytes + template_bytes)[:12]. deterministic — yaml 이나 template 이 바뀌면 hash 변경, analyze_events. policy_version 컬럼으로 drift 추적. - KNOWN_4B_TASKS / KNOWN_26B_TASKS — 잘못된 task 호출 ValueError. - 미정의 subject_domain 은 fallback_domain.description 사용. app/services/prompt_versions.py: - compute_policy_version(task) helper 추가. app.policy 지연 import 로 worker 경로에 정책 dependency 유입 방지 (런타임 격리). - 기존 ASK_PROMPT_VERSION / ANALYZE_PROMPT_VERSION 상수 미변경. plan: ~/.claude/plans/wise-gliding-hippo.md Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 09:34:48 +09:00
Hyungi Ahn	5057c48ad3	feat(policy): audit — forbidden pattern detection check_4b_output_violations(text, subject_domain) → list[str]. Python re.search 기반 (Postgres regex 아님). forbidden_for_4b 에서 해당 subject 에 적용되는 rule 만 선택 후 detection_patterns 순회. 컴파일된 패턴 lru_cache 로 반복 호출 비용 감소. escalate_to_26b=False 인 event 에만 호출하여 policy_violation=true 기록 + under_escalation 재처리 후보로 포획. plan: ~/.claude/plans/wise-gliding-hippo.md Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 09:34:48 +09:00
Hyungi Ahn	3314b44918	feat(policy): decide_routing + INV-1~6 invariants RoutingDecision frozen dataclass. 6 deterministic invariants (code-level HARD): - INV-1 self_declare_add_only: deterministic=True & self=False → high_impact 유지 - INV-2 risk_flag_requires_26b: any flag.requires_26b=True → 강제 escalate - INV-3 context_cap: content_chars > 120000 → long_context escalate - INV-4 multi_doc: evidence_doc_count >= 3 → multi_doc escalate + multi_doc_dependency flag - INV-5 risk_flags UNION merge: default + self_declared + derived 전부 합집합 - INV-6 fallback_domain: 미정의 subject → fallback_domain 적용 (routing None 방지) reason 상수 노출 (REASON_HIGH_IMPACT / REASON_RISK_FLAG / REASON_LOW_CONFIDENCE / REASON_LONG_CONTEXT / REASON_MULTI_DOC / REASON_FALLBACK_DOMAIN) — 테스트 + PR-B escalation_service 재사용. synthesis_directives 는 수집된 risk_flags 의 directive 만 자동 집계 (정렬 고정). plan: ~/.claude/plans/wise-gliding-hippo.md Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 09:34:48 +09:00
Hyungi Ahn	b401085518	feat(ai): EscalationEnvelope contract (4B→26B handoff) frozen dataclass with from_stage / escalation_reasons / risk_flags / distilled_context / original_pointers / synthesis_directives / user_intent / draft_hint. JSON round-trip (to_json/from_json). to_system_injection() 으로 26B system prompt 에 주입할 텍스트 블록 생성 (risk_flags + directives + distilled_context 순). from_stage 는 whitelist 검증 (triage/classify/summarize_short/advice_trigger/ night_sweep/ask_pre/unknown). tuple 타입 강제 (mutability 방지). PR-B 의 escalation_service 가 이 계약을 사용. PR-A 는 계약만 정의. plan: ~/.claude/plans/wise-gliding-hippo.md Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 09:34:48 +09:00
Hyungi Ahn	d23ea48223	feat(policy): pydantic schema + yaml loader app/policy/schema.py — DomainPolicy, SubjectDomain, FallbackDomain, RiskFlag, ForbiddenRule, Escalation, Observability (pydantic v2, frozen). suggested_ui_category 는 실측 doc_category enum (document\|library\|news\|memo\| audio\|video\|law) 만 허용. synthesis_directive 500 chars 제한. cross-reference validator — default_risk_flags 가 미정의 flag 참조 시 ValidationError. app/policy/loader.py — load_policy(path) + functools.lru_cache. env POLICY_PATH override, read_policy_bytes() helper (policy_version hash 용). plan: ~/.claude/plans/wise-gliding-hippo.md Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 09:34:48 +09:00
Hyungi Ahn	fad73ba88c	feat(policy): domain_policy.yaml v1 (safety_health + news) PR-A 의 Single Source of Truth. subject_domains 9개 (safety_reference/ safety_operational/msds/hazard_specific/incident_report/health_record/ safety_video/news_item/news_digest_request) + fallback_domain + risk_flags 10개 + forbidden_for_4b 6 카테고리 + escalation 임계값 + observability. Axis 원칙 (feedback_category_vs_ai_domain_axis.md): - subject_domain 매칭 키 = source_channel/keywords/tags/ai_domain - documents.category 는 UI 축 (매칭 키로 사용 금지) - suggested_ui_category 는 OUTPUT 매핑 (분류 제안용) Scope: safety_health + news 만. 소설은 별도 정책으로 분리. plan: ~/.claude/plans/wise-gliding-hippo.md Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 09:34:48 +09:00
Hyungi Ahn	ddfcdbb68a	fix(documents): frontend 에 category URL param 지원 추가 `/documents?category=law` 같은 URL 이 프론트에서 무시되던 버그 — `+page.svelte` 의 filter state 에 `category` 가 빠져 있어 API 호출 시 `?category=` 가 서버로 전달 안 됐음. 결과적으로 default 목록 (news/law 만 제외한 전체) 이 반환됐다. Sidebar '법령 알림' 버튼 (`e88640d`) + API `category` 필터 (§§2A) 는 이미 반영됨 — 프론트 middleware 만 추가. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 09:20:21 +09:00
Hyungi Ahn	e88640d3d8	feat(category): law 카테고리 분리 — enum + backfill + classify skip - migrations/152: ALTER TYPE doc_category ADD VALUE 'law' (DDL only; PG16 단일-트랜잭션 제약상 backfill 은 별도) - models/document.py: Enum 에 'law' 추가 (7 활성 + 3 유보) - workers/law_monitor.py: Document(..., category='law') — 신규 유입부터 세팅 - workers/classify_worker.py: source_channel='law_monitor' early-return + 최소 필드 (ai_domain='법령', ai_tags=['법령'], importance='medium'). AI classify skip — 법령 구조 고정/외부 source of truth/자동 재수집 - scripts/backfill_category.py: law 분기 + WHERE re-target ((source_channel='law_monitor' AND category='document')) + VERIFY cat_law/law_source_count + fail 조건 - api/documents.py: default 목록 제외에 law_monitor 추가 (news 와 동일 패턴) - api/dashboard.py: documents count FILTER 에 law_monitor 제외 (category_counts.law 는 기존 GROUP BY category 로 자동 노출) - frontend/Sidebar.svelte: '법령 알림' 버튼 ?source=law_monitor → ?category=law (explicit category 경로가 default exclusion 을 skip) plan: ~/.claude/plans/stateless-churning-raccoon.md axis 원칙: category=UI 축, policy/telemetry=source_channel+ai_domain 축 (feedback_category_vs_ai_domain_axis.md) 배포 순서: push → GPU pull → compose up --build fastapi frontend → backfill --dry-run → --apply. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 09:14:56 +09:00
Hyungi Ahn	91d0fcd023	fix(ui): document-caddy trusted_proxies 설정 (mixed-content 해소) document-caddy 가 home-caddy 로부터 받은 X-Forwarded-Proto: https 를 신뢰하지 않고 incoming scheme (http) 로 덮어써 FastAPI 가 받은 proto 가 http 로 인식 → /api/documents 307 Location 헤더가 http:// 로 나가 HTTPS 페이지에서 mixed-content block. private_ranges 를 trusted_proxies 로 설정해 docker bridge 내부의 home-caddy 가 전달한 X-Forwarded-* 를 보존. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 07:29:45 +09:00
Hyungi Ahn	5cf7952b33	fix(ui): 카테고리 내비 상단 이동 + uvicorn proxy-headers - +layout.svelte 상단 nav 에 오디오/비디오 추가 (문서/자료실 옆, 카테고리 계열 그룹). Sidebar 는 §2 에서 추가했던 카테고리 블록 제거하고 기존 도메인 트리 전용으로 복구 — 상단 nav 와 중복되고, 사이드바가 카테고리 탐색 1차 진입점으로 적합하지 않다는 피드백 반영. - app/Dockerfile uvicorn 에 --proxy-headers --forwarded-allow-ips=* 추가. FastAPI 의 trailing-slash 307 리다이렉트가 X-Forwarded-Proto 를 무시해 Location 헤더를 http:// 로 생성 → HTTPS 페이지에서 mixed-content block (/video 에서 목격). home-caddy → document-caddy → fastapi 체인에서 scheme 복구. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 07:28:04 +09:00
Hyungi Ahn	a8e24ab114	fix(documents): accept-suggestion 항상 409 버그 + compose 127.0.0.1 바인딩 - accept-suggestion: documents.updated_at != expected stale 검사 제거. classify_worker 가 source_updated_at 을 pre-commit 값으로 저장하는데 SQLAlchemy onupdate 가 commit 에서 updated_at 을 bump → 항상 불일치 → 승인 영구 불가. payload 교체 검사 하나만으로 core race 는 막힘. 사용자 직접 편집 감지는 별도 user_updated_at 컬럼 도입 시 재논의. - docker-compose.yml: postgres/kordoc/fastapi/frontend 포트 127.0.0.1 바인딩. GPU 서버 로컬에만 있던 drift 를 main 으로 승격. UFW-Docker 우회 컨텍스트에서 불필요한 LAN 노출 축소. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 07:20:38 +09:00
Hyungi Ahn	31d76edba0	feat(dashboard): §4 — 카테고리/제안/queue lag 카드 + docs/categories.md frontend +page.svelte: - 4-card 메인 row 아래 새 row 추가: 자료실/오디오/비디오 (category_counts) + 자료실 제안 (library_pending_suggestions). 제안 ≥1 일 때 warning 색 + /library 링크. - buildPipelineRows 가 pipeline_status (24h 누적) + queue_lag (현재 시점) 머지. queue_lag.oldest_pending_age_sec 가 600초 초과면 stage 라벨 옆에 경과시간 표시. - STAGE_ORDER/LABEL 에 stt/thumbnail 추가 (§3 신규 stage 자동 커버). docs/categories.md (신규): - 6 활성 + 3 유보 카테고리 정의 + 저장 경로 + 처리 파이프 - 역할 분리 원칙 (category / user_tags @library/ / facet_doctype / ai_suggestion) - 업로드 경로 매트릭스 (web/NAS/collector/UI) - video 채널별 정책 표 (web 거부 vs NAS quarantine) - 업로드 한도 + error_code 7종 표 - orphan 임시파일 cleanup 정책 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 07:09:37 +09:00
Hyungi Ahn	cec464ae2d	fix(media): §3 ship-readiness — stt preload + healthcheck + queue enum + dashboard queue_lag stt: - services/stt/server.py: lazy → eager preload in FastAPI lifespan. STT_PRELOAD=0 으로 lazy 강제 가능 (개발/테스트). preload 실패해도 프로세스는 살아 있고 /ready false 로 남아 healthcheck 가 unhealthy 처리. - docker-compose.yml: healthcheck /health → /ready. /health 는 단순 liveness 라 모델 미적재 상태도 healthy 로 잡혀 운영 신호 부적합. queue ORM: - app/models/queue.py: process_stage enum 에 'stt'/'thumbnail' 추가 + create_type=False (migration 150/151 가 DB enum 확장 담당). 이게 없으면 stt_worker INSERT 시 SQLAlchemy 가 enum value 를 거부. dashboard 강화 (§4 선제, §3 신규 stage 까지 자동 커버): - app/api/dashboard.py: category_counts + library_pending_suggestions + queue_lag (stage 별 pending/processing/failed + oldest_pending_age_sec). - frontend/src/lib/stores/system.ts: QueueLag 타입 + DashboardSummary 확장. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 07:04:52 +09:00
Hyungi Ahn	8f25d396df	feat(upload): §4-독립 — error_code 체계 + .uploading orphan cleanup + 진행률/abort UX plan: ~/.claude/plans/luminous-sprouting-hamster.md §4 (1GB/stt/dashboard 외 독립 항목) backend: - _upload_error(status, code, msg) 헬퍼 정의 (§3 가 호출만 추가했던 누락 수정). detail = {error_code, message} — 프론트가 error_code 로 분기. - upload_document 의 모든 HTTPException 을 _upload_error 로 전환: body_too_large / invalid_input / empty_file / unsupported_codec / internal - ClientDisconnect → 499 network_abort + 임시파일 정리. asyncio.TimeoutError → 408 upload_timeout. - 쓰기 중 .uploading 임시명 → 완료 후 staging.replace(target) atomic rename. → 프로세스 크래시 잔존물은 cleanup_orphan_uploads 가 수거. - file_watcher SKIP_EXTENSIONS 에 .uploading 추가 (오해 픽업 방지). cleanup scheduler: - workers/upload_cleanup.py 신규. 10분 주기로 Inbox 하위 *.uploading 중 mtime > orphan_max_age_sec(3600) 인 파일 삭제. - 최근 3회 (≈30분) 누적 삭제 수가 cleanup_warn_threshold(10) 이상이면 WARNING 로그. in-memory deque (재시작 시 리셋) — 집요한 이슈만 잡는 목적. - core/config.py UploadConfig 에 두 임계치 필드 (defaults — config.yaml override 무관). frontend: - api.ts: ApiError 에 optional errorCode/errorMessage 필드 (detail string 유지로 기존 5+ 소비자 호환). parseDetail() 가 {error_code, message} 객체 응답을 풀어 정규화. uploadFile(path, formData, {signal, onProgress}) XHR 헬퍼 신규 (fetch() 가 upload progress 미지원이라 XHR). 401 refresh 1회 정책 동일. - UploadDropzone.svelte 재작성: 진행률 바, 파일별/전체 abort 버튼, 페이지 이탈 beforeunload 경고, errorCode 별 토스트 메시지 분기 (7 코드 — body_too_large / upload_timeout / network_abort / empty_file / invalid_input / unsupported_codec / internal). 컴포넌트 unmount 시 진행 중 업로드 abort. 보류: - max_bytes 1GB 상향 + Caddyfile 1100MB (별도 결정으로 100MB 유지) - /dashboard 카테고리 카드 (별도 plan) - docs/categories.md (§1-3 정의 안착 후) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 06:57:02 +09:00
Hyungi Ahn	1e2c004dd4	feat(media): §3 audio STT + video 재생 인프라 plan: ~/.claude/plans/luminous-sprouting-hamster.md §3 스키마: - migrations/147_audio_segments_table.sql: audio_segments (STT 타임스탬프 세그먼트) - migrations/148_audio_segments_idx.sql: (document_id, start_s) idx - migrations/149_document_media_cols.sql: documents.thumbnail_path + needs_conversion - migrations/150_queue_stage_stt.sql: process_stage += 'stt' - migrations/151_queue_stage_thumbnail.sql: process_stage += 'thumbnail' - app/models/audio_segment.py, document.py (thumbnail_path/needs_conversion) 서비스: - services/stt/{Dockerfile, requirements.txt, server.py} — faster-whisper large-v3 GPU 컨테이너. /transcribe (filePath/langs/beamSize) + /health + /ready (cuda device_count + model_loaded). NFC/NFD 경로 resolver (OCR 교훈). - docker-compose.yml: stt-service 추가 (GPU 1 예약, :3300, NAS ro mount, stt_models volume, start_period 300s), fastapi env 에 STT_ENDPOINT. 파이프라인 (의존 §1 category): - app/workers/stt_worker.py 신규: stage='stt' pickup → STT_ENDPOINT 호출 → extracted_text + audio_segments 저장. Timeout 30분. - app/workers/thumbnail_worker.py 신규: ffmpeg 50% 지점 1장 → PKM/Videos/.thumbs/{id}.jpg + thumbnail_path 세팅. needs_conversion=true 는 skip. - app/workers/file_watcher.py 확장: PKM/{Inbox, Recordings, Videos} 스캔. 확장자→category, audio→stage=stt, video .mp4/.webm→ stage=thumbnail, video .mov/.mkv/.avi→needs_conversion=true + stage 없음. settings.roon_library_path prefix skip. - app/workers/queue_consumer.py 확장: stt + thumbnail workers 등록, BATCH_SIZE(stt=1, thumbnail=3), next_stages 에 stt→[classify] 추가 (audio 는 extract 건너뜀). - app/Dockerfile: ffmpeg 추가 (썸네일 subprocess 용). API (의존 §1): - /api/audio/{id}/segments — AudioSegment ORDER BY start_s - /api/video/{id}/thumbnail — thumbnail_path FileResponse (쿼리 토큰) - /api/documents/{id}/file: media_types 에 audio/video mime 포함 (§2 커밋에 이미 포함). Starlette FileResponse 가 Range 자동. - upload_document: .mov/.mkv/.avi 웹 업로드 거부 (error_code unsupported_codec). NAS 드롭은 file_watcher 가 quarantine 수용. 프론트: - AudioPlayer.svelte: HTML5 audio + 전사 세그먼트 sticky 패널 + 줄 클릭 seek. activeIdx 하이라이트. - VideoPlayer.svelte: HTML5 video direct play + needs_conversion 안내 카드. poster 는 thumbnail endpoint. - /audio (목록 grid) + /audio/[id] (플레이어) - /video (썸네일 grid + 변환 필요 배지) + /video/[id] (플레이어) - Sidebar.svelte: Mic/Film 아이콘 + audio/video 네비 활성, count 배지 (§2 /stats/category-counts 재사용). 설정: - app/core/config.py: stt_endpoint + roon_library_path. DoD 배포 후 smoke: /ready cuda:true, 회의 mp3 transcribe, audio extract 없이 classify 진행(queue 회귀), /audio 재생, .mp4 재생, .mov 웹 400, .mov NAS quarantine, Sidebar 네비 + count. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 06:47:36 +09:00
Hyungi Ahn	aceb54e586	fix(migrations): 143 asyncpg multi-statement 분리 asyncpg prepared statement 는 single-command 만 지원 (core/database.py exec_driver_sql 경로). §1 의 143_category.sql 이 4 statement (TYPE + ALTER + INDEX×2) 였어서 fastapi 부팅 시 asyncpg.PostgresSyntaxError "cannot insert multiple commands into a prepared statement" 로 실패 → 컨테이너 restart 루프. 143 을 4 개 파일로 분리: 143: CREATE TYPE doc_category 144: ALTER TABLE documents ADD category / ai_suggestion 145: CREATE INDEX idx_documents_category 146: CREATE INDEX idx_documents_has_suggestion (partial) DB 상태는 깨끗 (migration 143 이 부분 적용 안 됨 — asyncpg 가 batch 자체를 reject). schema_migrations 에 143 도 미기록이라 재실행 안전. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-23 15:46:00 +09:00
Hyungi Ahn	a93d1689d8	feat(documents): §2 카테고리 전용 페이지 + 승인 UI plan: ~/.claude/plans/luminous-sprouting-hamster.md §2 - GET /api/documents/stats/category-counts — Sidebar/Dashboard 용 카테고리별 문서 건수 + library_pending_suggestions - DocumentResponse 에 category / ai_suggestion 필드 노출 (§1 과 동일 수정, rebase 시 합쳐짐) - SuggestionReview.svelte 신규 — ai_suggestion.proposed_category='library' 제안 카드 리스트. 단건 승인/반려 + 체크박스 대량 승인. 409 stale 시 warning toast + 자동 refetch - /library 상단에 SuggestionReview 배치 (자료실 + 승인 대기함 겸). 승인/반려 후 tree/docs/facet 재조회 - Sidebar 재구성: 카테고리 내비(문서/자료실/뉴스/메모/검색) + 자료실 pending 배지. /api/documents/stats/category-counts 바인딩. audio/video 자리는 §3 주석 예약 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-23 15:36:22 +09:00
Hyungi Ahn	8fdea88676	feat(documents): §1 category enum + ai_suggestion 승인 파이프 plan: ~/.claude/plans/luminous-sprouting-hamster.md §1 - migrations/143_category.sql: doc_category enum (6 활성 + 3 유보) + documents.category + documents.ai_suggestion JSONB + 2 idx. - app/models/document.py: category (Enum, create_type=False), ai_suggestion (JSONB). - app/prompts/classify.txt: document_type enum 에 7 실무 doctype 추가 (발주서/세금계산서/명세표/도면/증명서/계획서/시방서) + facet_doctype 필드 directive. - config.yaml: document_types 에 7 항목 추가 (worker 검증 통과). - app/workers/classify_worker.py: FACET_DOCTYPES / LIBRARY_SUGGESTION_DOCTYPES 상수, facet_doctype 파싱(기존값 미덮어씀), 발주서/세금계산서/명세표 감지 시 ai_suggestion={proposed_category=library, proposed_path=@library/ 거래/{YYYY}/{doctype}, source_updated_at=doc.updated_at.isoformat(), ...}. category / user_tags 자동 전이 금지 (suggestion-only). - app/api/documents.py: · DocumentResponse 에 category / ai_suggestion 노출 · GET /documents ?category=<cat> / ?has_suggestion / ?proposed_category (category 지정 시 기본 news/memo 제외 해제 — §2 승인 UI 계약) · GET /documents/library 를 Document.category=='library' 기반으로 재구현 (path subquery 는 user_tags 유지 — 분류 내부 서가 경로) · POST /documents/{id}/accept-suggestion — FOR UPDATE + idempotent no-op + dual 409 stale (payload source_updated_at / documents.updated_at) + user_tags idempotent append · DELETE /documents/{id}/suggestion — idempotent, stale 검사 없음 - scripts/backfill_category.py: dry-run / apply. 매핑(news/memo/@library/else) + 3-way 상대 검증 (all_rows==categorized, uncategorized==0, cat_library==has_library_tag — 자동 전이 금지 정책 검증). 남은 DoD (원격 배포 후): docker compose up → migration 143 적용 → backfill apply → smoke (drive_sync 발주서 업로드 suggestion 생성 / category 유지, accept-suggestion idempotency + 409 stale 두 벡터, /documents?category=library == /documents/library 건수 일치). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-23 15:32:01 +09:00
Hyungi Ahn	e861784c86	fix(ocr): align torch/transformers with native venv (0.17.1 호환 확인된 조합) 이전 base image (pytorch/pytorch:2.5.1-cuda12.4) 가 surya-ocr 0.17.1 설치 시 torch 2.11.0 (PyPI CPU wheel) 로 업그레이드되지만 torchvision 0.20.1+cu124 는 유지돼 ABI 불일치 (torchvision::nms does not exist) → OCR 전체 실패. native /opt/surya-ocr/venv 에서 검증된 조합으로 복제: - python:3.12-slim base - torch 2.11.0+cu126 / torchvision 0.26.0+cu126 (PyTorch cu126 index 고정) - transformers 4.57.6 (5.x 는 surya detection.processor import 에서 실패) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-23 13:59:15 +09:00
Hyungi Ahn	f8f72ceae2	fix(ocr): Surya 0.17 API + NFC/NFD path normalize - services/ocr/server.py: surya 0.17.x predictors 기반으로 재작성 (구 `from surya.ocr import run_ocr` 제거됨 → import error → 빈 텍스트 반환) - NFC(DB 경로) vs NFD(NFS 파일시스템) 한글 정규화 mismatch 보정 - surya-ocr 버전 0.17.1 고정 (0.6~1.0 범위는 breaking change 노출) - AIClient.ocr() NotImplementedError 제거 (호출처 0건, extract_worker 가 ocr-service HTTP 호출을 직접 사용) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-23 13:52:19 +09:00
Hyungi Ahn	51a6f7c9af	feat(eval): 발주건 단위 baseline 평가 경로 추가 - run_eval.py: --queries-order / --order-groups / --output-order / --debug 옵션 추가. 기존 legacy CSV 스키마/값 불변 (출력 소비자 보호). - Tier 1A/1B/2 지표 구현: cross_format_link_success (top-10 공식 + top-5 보조, eligible/success 분수), top_5_document_match (guardrail + 절대 건수), manual_refind_flag (v0 heuristic), chunk_idx_stddev, range/page_citation_available capability flags. - order_groups.yaml: 발주건 3건 매핑 (TKP-26-0114/0132/0112, 10 docs). - queries_order_baseline.yaml: 12개 질문 (A:4 B:4 C:3 D:1). plan: ~/.claude/plans/merry-yawning-owl.md	2026-04-20 15:04:39 +09:00

1 2 3 4 5 ...

340 Commits