hyungi_document_server

Author	SHA1	Message	Date
hyungi	3092e3009d	feat(eval): Phase 2A Diagnose Phase 3+4 — dispatcher + 3 측정 + decision (H3 bge-m3 유지) phase-2a-embedding-diagnose.md v4 § 6 (dispatcher) + § 7 Phase 3 (51 case 측정) + § 7 Phase 4 (decision) Round 2 review: round-2-review-mighty-starfish.md (R2-2 + R2-B1 페어 invariant + slug-based resolve) 코드 변경: - app/services/search/retrieval_service.py: - CANDIDATE_BACKEND_MAP allowlist (baseline / cand_me5_large_inst / cand_snowflake_l_v2) - _resolve_backend(slug) → docs_table/chunks_table/embed_endpoint or None - _embed_query_via_tei() — candidate TEI 엔드포인트 호출 (cache 미사용) - _VALID_DOCS_TABLE + _VALID_CHUNKS_TABLE regex (R2-B1 2단계 gate) - _search_vector_docs / _search_vector_chunks: docs_table/chunks_table + snapshot__id_max 파라미터 - search_vector + search_vector_multilingual: embedding_backend + snapshot__id_max 파라미터 + dispatch log - app/services/search/search_pipeline.py: run_search() 시그니처 + 4 search_vector* 호출 threading - app/api/search.py: 3 Query parameter + ValueError → HTTP 400 (allowed list 응답) - tests/search_eval/run_eval.py: --embedding-backend + --snapshot-doc-id-max + --snapshot-chunk-id-max + call_search/call_search_full/evaluate threading + main 3 asyncio.run threading 측정 산출물 (51 case, scored=46, failure=5): - reports/v0_2_phase2a_baseline_snapshot_2026-05-23.csv (snapshot filter 적용 production path) - reports/v0_2_phase2a_me5_large_inst_2026-05-23.csv - reports/v0_2_phase2a_snowflake_l_v2_2026-05-23.csv - tests/search_eval/baselines/v0_2_phase2a_{baseline_snapshot,me5_large_inst,snowflake_l_v2}_2026-05-23.json (3개) 결과: \| Candidate \| NDCG \| Δ vs baseline \| mixed \| korean_only \| p50 ms \| \|------------------------------------\|-----:\|--------------:\|------:\|------------:\|-------:\| \| bge-m3 (baseline snapshot) \| 0.659\| — \| 0.39 \| 0.51 \| 464 \| \| cand_me5_large_inst \| 0.477\| -0.182 \| 0.17 \| 0.47 \| 194 \| \| cand_snowflake_l_v2 \| 0.616\| -0.043 \| 0.35 \| 0.52 \| 254 \| Decision (H3): bge-m3 유지. 둘 다 net 회귀. - mE5-large-instruct: 전 카테고리 회귀 (-0.182). prefix 미적용 변수 — 별 PR PR-2A-mE5-Prefix-Retry 후보. - snowflake_l_v2: 가벼운 회귀 (-0.043). korean_only +0.01 미세 개선 신호. - korean_only/mixed 약점 보완은 Phase 2B (Reranker) 또는 Phase 2Q (Query rewrite) 권고. Decision report: reports/phase_2a_embedding_decision_2026-05-23.md (§ 1~8 포함, Closure gate 16 항목 모두 PASS). 후속 PR 백로그: - PR-2A-mE5-Prefix-Retry (별 PR) - PR-2A-Extended-Bge-Mgemma2 (별 PR, v3 결정) - PR-2A-Cloud-Embedding-Scaffold-1 (Cohere/Voyage scaffold-only, 선택) - PR-Search-Query-Rewrite-1 (Phase 2Q) - PR-Search-Reranker-V2-Diagnose (Phase 2B) - PR-2A-Chunks-Cand-Cleanup-1 (1주 후 cand 테이블 DROP) production 영향: - documents / document_chunks 컬럼/row 변경 0 - config.yaml 변경 0 (ollama bge-m3 unchanged) - 추가된 endpoint = query parameter opt-in (미지정 시 production path 회귀 0) - smoke 4건 PASS (baseline / baseline+snapshot / cand_me5 / cand_invalid → HTTP 400) - dispatch log 박제 verify (snapshot_doc/chunk_id_max 박제) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-23 06:55:13 +00:00
hyungi	5cb8d04b50	feat(ai): config-driven sampling profile — triage T=0, primary T=0.3 top_p=0.9 P1 of family-adaptive-bengio (Mac mini 4-lever bundle). AIModelConfig: temperature/top_p Optional fields (None = server default). _request OpenAI/MLX branch payload 조건부 sampling 인자 삽입. config.yaml ai.models.triage.temperature=0.0 (deterministic) / primary temperature=0.3 top_p=0.9 (summary creativity). fallback (Anthropic) branch 미적용 — 별 plan 범위. caller 코드 무변경. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-23 06:37:46 +00:00
hyungi	bcf644f893	refactor(search): /api/search/ask dispatcher route via llm-router PR-2 of DS AI routing policy (2026-05-23, see plan ~/.claude/plans/document-server-ai-cheeky-reddy.md + memory project_document_server_ai_routing_policy). DS 의 모든 backend 호출이 llm-router :8890 단일 경유. 정칙 정합: - 신규 RouterBackend (services/llm/backends.py) — alias 별 router POST + requires_gate 분기 (mac-mini-default 만 llm_gate FOREGROUND 보호). - 기존 GemmaMacMiniBackend + QwenMacBookBackend = legacy 보존 (DS_BACKENDS_VIA_ROUTER=false rollback safety only). 1주 후 별 cleanup PR (PR-DS-Backends-Legacy-Cleanup-1) 로 폐기. - get_backend factory dual-path (env flag) — backward-compat (gemma-macmini alias → mac-mini-default 매핑). - search.py:457 Query pattern 확장: mac-mini-default\|claude-cloud\|auto 추가. /ask/react 의 isinstance(QwenMacBookBackend) → hasattr duck-typing (RouterBackend + Legacy 모두 generate_with_tools 구현). - SearchAskBackendConfig 에 router_url 신규 (env LLM_ROUTER_URL 또는 hardcoded MVP default http://100.76.254.116:8890). - docker-compose.yml fastapi env 에 LLM_ROUTER_URL + DS_BACKENDS_VIA_ROUTER 추가. AIClient (_call_chat, call_triage, call_primary, call_fallback) 경유 path 는 별 PR (PR-AIClient-Router-Migration-1) — MVP scope C 채택, 회귀 risk 최소화. Closure (즉시 fixture/matrix): - factory smoke 6 alias (None/mac-mini-default/gemma-macmini/ qwen-macbook/claude-cloud/auto) + 1 invalid (nonsense → ValueError). - live 3 case: mac-mini-default 200 \"pong! 🏓\" + qwen-macbook cold 502 upstream_502_primary=ConnectError + claude-cloud 503 provider_not_configured. - silent fallback 0 + direct M5/Mac mini socket 0 (RouterBackend 만 router 호출). Backup: ~/.local/share/ds-routing-pr2-backups/20260523/ (backends.py + config.py + search.py + docker-compose.yml). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-23 03:41:29 +00:00
hyungi	51c3f6df10	feat(search): /ask/react endpoint with Qwen native tool calling ReAct loop PR-DocSrv-Ask-ToolCalling-ReAct-1 — Qwen3.6-27B-8bit 의 native tool calling 으로 ReAct loop 도입. 기존 /api/search/ask 무수정. 트랙 B (frontend /ask SSE) 와 파일 단위 충돌 0 (search.py 의 ask() 함수 line diff = 0, 순수 추가). 핵심 invariant: - 별 endpoint /api/search/ask/react (qwen-macbook only, implicit opt-in) - MacBook unavailable 시 HTTP 503 + error_reason=macbook_unavailable. Gemma 자동 fallback X (정정 4 의 연장) G0 (구현 전 hard gate, plan b-velvety-hare.md): - G0-1 fixture (tests/fixtures/qwen_tool_call_response.json): 실제 mlx-vlm 응답 박제. shape = OpenAI 표준 호환 (choices[0].message.tool_calls + function.arguments JSON string). generate_with_tools() 가 본 shape 기준 구현. - G0-2 counter semantics: max_tool_rounds=2 + max_llm_calls=3 + search_exec_max=2. 마지막 LLM 호출은 tool_choice="none" + system instruction 으로 final 강제. - G0-3 trace exposure: default response 의 debug_trace=null. debug=true 시만 채움. server log 에는 항상 round 기록. backends.py (193 → 261줄): - QwenMacBookBackend.generate_with_tools(messages, tools, tool_choice) 신규 method. 기존 generate() 무수정. BackendUnavailable 처리 동일. react_loop.py 신규 (275줄): - agentic_ask_loop(session, query, *, backend, max_tool_rounds, debug) - tool round 안에서 run_search 호출, results dedup by id, final round 강제, partial=True 조건 (final content 빈 경우) search.py (+82줄): - POST /api/search/ask/react + AskReactRequest/Response schema - BackendUnavailable → JSONResponse(503, error_reason=macbook_unavailable) config.yaml + config.py: - search.ask.react: { enabled, max_tool_rounds=2, search_tool_limit=5, search_tool_mode=hybrid } tests (566줄, 18 신규 + 23 회귀 모두 PASS): - test_react_loop.py 13건: G0-1 fixture shape / G0-2 counter cap / G0-3 trace exposure / BackendUnavailable propagation / sources dedup - test_search_ask_react_endpoint.py 5건: 503 + run_search 호출 0 / 정상 200 / debug=true trace 노출 / max rounds partial - 회귀 (test_ask_eval_auth 9 + test_search_ask_macbook_503 5 + test_backend_dispatcher 9) 모두 PASS Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-22 13:43:47 +00:00
hyungi	a7b8f15870	feat(search): /ask backend dispatcher (qwen-macbook opt-in, no silent fallback) PR-MacBook-RAG-Backend-1 — /api/search/ask 의 명시 backend 선택 진입점. 핵심 invariant (정정 4): - backend 미지정 = Gemma Mac mini default, 응답 contract 변동 0 - backend="qwen-macbook" 명시 opt-in 만 MacBook M5 Max mlx-vlm.server 호출 - MacBook unavailable 시 HTTP 503 + error_reason=macbook_unavailable - 자동 fallback 절대 금지 — 실패 path 에서 Gemma backend.generate() 호출 0 backend dispatcher (services/llm/): - BackendBase / GemmaMacMiniBackend / QwenMacBookBackend / BackendUnavailable - Qwen backend 는 Mac mini llm_gate 점유 X, 별 Semaphore(1) — llm_gate docstring 의 single-inference 영구 룰은 같은 endpoint 한정으로 scope 명시 - httpx Connect/Read/Pool/Timeout/5xx → BackendUnavailable, 4xx 전파 synthesis_service.py: - backend 인자 추가, status="backend_unavailable" 신규 - cache key 에 backend_name 포함 (qwen ↔ gemma 캐시 충돌 차단) config: - search.ask.backend.{macmini_url, macbook_url, macbook_model, timeout_connect_s=1, timeout_read_s=30} - MacBook endpoint = http://100.118.112.84:8810 (M5 Max Tailscale bind) tests (14 신규): - tests/services/test_backend_dispatcher.py (9): dispatcher 정합성 + Qwen generate path (mock 200 / dead port / 5xx / 4xx) + cache identity - tests/api/test_search_ask_macbook_503.py (5): 정정 4 핵심 invariant. backend=qwen-macbook 비가용 시 gemma.generate.assert_not_called() 기존 ask 회귀 0 (test_ask_eval_auth 9건 등 85건 모두 PASS). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-22 13:10:44 +00:00
Hyungi Ahn	eae1f48d62	feat(worker-pool): Registry-1C cap 1MB + deterministic compaction 사용자 결정 2026-05-19: 100KB cap 이 운영 7d 데이터 1.36MB 대비 부족 → cap 상향만으로 raw 비대화 위험. cap 1MB + payload compaction 병행. fetch_recap_context() 변경: - memo payload item field 축소 = id/title/ai_tldr/ai_event_kind/created_at (5 필드) (ai_bullets/file_type/source_channel/category/extracted_text 등 제외) - memo top-N = RECAP_MEMO_TOP_N env (default 200) — 초과분은 aggregate 로 - aggregate = memos_by_day + memos_by_kind + omitted_memos - payload_compacted flag = aggregate fallback 발현 여부 - events 는 raw (운영 7d 데이터에서 통상 0~소량) internal_worker.py: - PAYLOAD_MAX_BYTES → _payload_max_bytes() env override (WORKER_RECAP_PAYLOAD_MAX_BYTES default 1_000_000) - JobsRecapResponse 에 payload_compacted / omitted_memos 노출 - 413 detail 에 "after compaction" 명시 + RECAP_MEMO_TOP_N 조정 안내 테스트 3 항목 신규 + 기존 endpoint 413 test 업데이트: - 700 memo → 200 kept + 500 omitted + compacted=true + < 1MB - 10 memo → compacted=false + omitted=0 - 비정상 큰 title (compaction 후에도 cap 초과) → 413 유지 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-19 12:55:51 +09:00
Hyungi Ahn	0ea72c1aa6	feat(worker-pool): Registry-1C recap context + /jobs/recap + 100KB guard - app/services/worker_recap_context.py — fetch_recap_context(user_id, days) documents file_type='note' 7d (single-user invariant) + events 7d (user_id 매칭 + cancelled 제외) JOIN. timezone Asia/Seoul. - /internal/worker/jobs/recap POST — 일반 user JWT 인증 + context 조립 + worker_jobs INSERT. job_type='recap' + payload JSONB. - payload 100KB guard — JSON 직렬화 100_000 bytes 초과 시 413. - 회귀 위험 0: memos/events API select 절 touch 0, read-only 쿼리만. worker-pool-policy §B.2 invariant 보존: ProcessingQueue 무변경, 운영 자동 분기 변경 0, canonical promote 0 (worker_jobs.payload JSONB only). Notebook-Pilot-1 entry condition 4항목 모두 충족 가능: manual recap E2E / payload <100KB guard / residue 0 / 권한 분리 403. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-19 12:44:07 +09:00
Hyungi Ahn	f60d6e52fc	feat(worker-pool): Registry-1B Pull 활성화 (auth + worker_jobs + 5 endpoint) worker-pool-policy §B 1B 영역 완료. 1A scaffold (mig 270~274 + 503 stub) 위에: - mig 275/276: worker_jobs (status CHECK + user_id=owner) + pending partial index - create_laptop_worker_bot_token + require_worker_user dependency (voice-memo 동형) - /internal/worker/{register,heartbeat,claim,result,drain} 5 endpoint 실 구현 - /claim FOR UPDATE SKIP LOCKED + 204 body 0 - /result 소유권 검증 (worker_id 매칭, 404) + failed 재시도 (attempts/max) - explicit failure 시 request.result 무시 (DB result NULL 유지) - 테스트 22 항목 7 파일 policy §B.2 5 invariant 보존: voice-memo wrapper 변경 0, drain advisory, result raw JSONB, ProcessingQueue 무변경, 운영 자동 분기 변경 0. 활용처 (recap context + /jobs/recap + payload 100KB guard) = Registry-1C 영역. stale recovery / 노트북 client / canonical promote = Notebook-Pilot-1 영역. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-19 08:54:07 +09:00
hyungi	acd29b963e	ops(triage): event_kind_hint diagnostic logging cleanup (PR-4B Apply 영구 보류) chore-memo-NULL-backfill 6/6 H1 (historical artifact) 확정 후 Apply PR 영구 보류. `406b810` 의 8-line logger.info 블록 제거 (behavior 변경 0, 진단 데이터 더 이상 불필요). backup: app/workers/classify_worker.py.pre-eventkind-cleanup (7일 안전망 ~2026-05-25) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-18 11:27:29 +00:00
Hyungi Ahn	bbd92a840a	feat(worker-pool): Registry-1A scaffold — worker_capabilities/heartbeats + /internal/worker/* 5 endpoint 503 stub PR-Worker-Pool-Registry-1A (scaffold only, no runtime activation). 신규: - migrations/270~274 (1 statement/1 file 강제): worker_capabilities + 2 idx + worker_heartbeats + 1 idx - app/models/worker_pool.py: WorkerCapability + WorkerHeartbeat ORM (queue.py 패턴) - app/api/internal_worker.py: 5 endpoint 모두 _stub_503() — register/heartbeat/claim/result/drain - tests/test_internal_worker_stub.py: 503 응답 smoke (inline ASGI client, DB 의존 0) 수정: - app/main.py: import + include_router 각 1줄 (prefix=/internal/worker, internal_study 일관) scaffold-first + phase-gate-material-first 강제 (worker-pool-policy §1, §12): - 인증 dependency 0 (1B 에서 JWT + require_worker_user) - ProcessingQueue 변경 0 (방향 b: worker_jobs 별 table = 1B) - LLM 호출 0 / canonical DB 변경 0 / 운영 자동 분기 0 회귀 0 (1주 안전망 = app/main.py.pre-registry-1a.20260518). plan: ~/.claude/plans/floofy-exploring-mitten.md Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-18 20:24:59 +09:00
hyungi	406b810e28	ops(triage): PR-4B-Diagnose-EventKindHint-Layer-A — diagnostic logging (no behavior change) Layer-A Diagnose only. classify_worker.py:691 직전에 event_kind_hint 의 raw/normalized/in_valid/confidence 값 capture (logger.info 5줄 insert, lazy formatting + %r repr). guard 통과 X 의 specific root cause (A1 field 부재 / A2 빈 string / A3 invalid enum) 확정용. specific fix (default note / enum mapping / prompt 강화) 는 별 PR-4B-Fix-EventKindHint-Apply. Apply PR closure gate 에 logging cleanup (info → DEBUG 또는 제거) 흡수. plan: ~/.claude/plans/c-1-pr-infra-drift-1-phase-1b-linear-frost.md backup: app/workers/classify_worker.py.pre-4b-eventkind-logging.20260517	2026-05-17 06:41:32 +00:00
hyungi	8998cbea8c	ops(triage): PR-4B-Diagnose — exception logging 강화 (type/repr/exc_info) Layer 1 root cause 진단을 위해 classify_worker.py:595 의 exception logging 을 lazy formatting + exc_info=True 로 강화. f-string 1줄 → 5줄 block. - type=%s: exception class name (TimeoutError/JSONDecodeError/ValueError/etc.) - repr=%r: full exception state - exc_info=True: traceback 까지 capture (wrapper 정확 지점 추적) 본 PR scope = Diagnose only. Layer 1 specific fix (H1/H2/H3/H4) + Layer 2 escalate path ai_event_kind fallback set 은 별 PR queue. plan: ~/.claude/plans/c-1-pr-infra-drift-1-phase-1b-linear-frost.md backup: app/workers/classify_worker.py.pre-4b-diagnose.20260517	2026-05-17 06:22:27 +00:00
hyungi	74876b674c	feat(auth): JWT iat + users.password_changed_at invalidation (PR-Docsrv-JWT-Invalidation-1) PR-Infra-Sec-1H Phase 0 audit 에서 DS jwt invalidation 정책 부재 확정. password rotation 으로 구 365d JWT (voice-memo-bot 등) invalidate 안 되는 hard gate STOP 진입 → 선행 PR 분리. - migration 269: users.password_changed_at timestamptz NULL (legacy 호환) - create_access_token / create_refresh_token: payload 에 iat (int 초) 추가 - verify_password_changed_at helper: int(password_changed_at.timestamp()) > int(iat) 시 401 - get_current_user + refresh_token route: verify helper 호출 - change_password / setup signup / seed_admin INSERT+UPDATE: password_changed_at 갱신 NULL = 검증 skip (migration 직후 운영 영향 0). 첫 password 변경 후만 iat 검증 활성. Sec-1H 의 G-token-old hard gate 통과 path 확보.	2026-05-17 06:20:46 +00:00
Hyungi Ahn	a08b620894	refactor(search): swap 10 call sites to acquire_mlx_gate(Priority.) (B-1) DS-Mac-mini-26B-Priority-Gate-1 — 사용자-facing 7 + worker 3 = 10 site 의 `async with get_mlx_gate():` → `async with acquire_mlx_gate(Priority.):` 교체. Foreground 6 (user-facing path): - app/services/search/evidence_service.py:315 (/ask evidence stage) - app/services/search/classifier_service.py:103 (/ask classifier stage) - app/services/search/synthesis_service.py:299 (/ask synthesis stage) - app/api/documents.py:1306 (수동 analyze API) - app/api/study_topics.py:1183 (subject note 동기 생성) - app/api/study_questions.py:1560 (study explanation 동기 API) Background 4 (worker queue / fire-and-forget): - app/services/search/query_analyzer.py:240 (V0 grep 확인: fire-and-forget only, search_pipeline.py:179 trigger_background_analysis 만, docstring rule "analyze() 동기 호출 금지" 부합 → BACKGROUND 확정) - app/workers/deep_summary_worker.py:110 (classify-escalate worker) - app/workers/study_explanation_worker.py:149 - app/workers/study_session_analysis_worker.py:237 Cleanup: - query_analyzer._get_llm_semaphore() 제거 — self-only, unused, signature 거짓말 (이제 get_mlx_gate 가 Semaphore 아닌 context manager 반환) 기존 get_mlx_gate() legacy wrapper 는 보존 (BACKGROUND 매핑). user-facing path 잔재 0 — closure gate grep 검증 통과 (별 commit 에서).	2026-05-17 08:51:57 +09:00
Hyungi Ahn	7c9aff393a	feat(search): MLX priority gate (B-1, Priority.FOREGROUND vs BACKGROUND) DS-Mac-mini-26B-Priority-Gate-1 — Mac mini 26B single-inference gate 를 FIFO Semaphore → 우선순위 기반 heap dispatch 로 교체. concurrency 1 유지, queue ordering 만 foreground 우선. API: - Priority(IntEnum): FOREGROUND=0, BACKGROUND=100 - acquire_mlx_gate(priority=DEFAULT_PRIORITY) async context manager - DEFAULT_PRIORITY = BACKGROUND (안전 default, foreground 짓밟지 않음) - get_mlx_gate() legacy wrapper — context-manager only 호환 구현: - _inflight: bool + _waiters heap [(priority, seq, future, enqueue_ts)] - fast-path: not inflight and not waiters → 즉시 inflight, Future 생성 X - _dispatch_next_locked: cancelled/done Future skip (heap 잔재 risk 회피) - release: lock 안에서 pop, set_result 는 loop.call_soon (lock 밖) reentry deadlock 회피 - dispatch / enqueue / release / WARN log (observability) - BACKGROUND wait_ms > 300_000 (5분) 시 starvation WARN — aging 은 Phase 2 deferred Tests (tests/test_priority_gate.py, 6 scenario): 1. FIFO within same priority 2. Foreground jumps queue (bg5 대기 중 fg 들어오면 즉시 다음 슬롯) 3. Long-running background blocks foreground (preemption X, intended) 4. Mixed concurrent enqueue (FG fifo 먼저, BG fifo 후) 5. Backward compat (legacy get_mlx_gate() = BACKGROUND 매핑) 6. Cancelled waiter skip (heap 의 죽은 Future 건너뜀, gate stuck X) Site 교체는 별 commit (refactor(search): swap 10 call sites). plan: ~/.claude/plans/hermes-polymorphic-rossum.md	2026-05-17 08:42:58 +09:00
Hyungi Ahn	73f328cb65	fix(search): DS RAG LLM_TIMEOUT_MS align 15s/3s → 30s/10s (B-3 Synthesis-Timeout-Calibration-1) PR-Hermes-Docsrv-Search-1 closure 측정 (synthesis_ms=30~48s / ev_ms=15005 / query_analyze 45s) 으로 15s LLM_TIMEOUT 빈발 timeout 확인. Mac mini 26B 동시 호출 (gate Semaphore 1 직렬화 후에도 evidence + synthesis + classifier + query_analyzer + verifier 가 sequential 누적) 시 각 호출 30s 까지 필요. 5곳 변경: - synthesis_service.LLM_TIMEOUT_MS 15000 → 30000 - evidence_service.LLM_TIMEOUT_MS 15000 → 30000 - verifier_service.LLM_TIMEOUT_MS 3000 → 10000 - query_analyzer.LLM_TIMEOUT_MS 15000 → 30000 - search.py:522 classifier wait_for 15.0 → 30.0 (classifier_service align) - search.py:641 verifier wait_for 4.0 → 10.0 (verifier_service align) classifier (이전 PR 에서 30s 로 align 완료) 와 동일 정책 — outer wait_for 가 inner LLM_TIMEOUT_MS 를 override 하지 않도록 align. ask 응답 latency 상한 ↑ 의도된 trade-off — 안정성 (refusal_gate conservative_refuse 회피 + grounding/verifier 정상 동작) 우선. 영향: PR-1 fixture 회귀 0 예상 (이전 timeout 이 새 한도 안). B-1 Throughput-1 (priority queue / 모델 분리) 별 PR 진입 시 latency 본격 단축 검토.	2026-05-17 08:01:22 +09:00
Hyungi Ahn	ad3d51e3e0	fix(search): classifier + evidence gate 안으로 이동 (Mac mini 26B race 종결) llm_gate.py docstring 영구 룰: "MLX primary 호출 경로는 예외 없이 gate 획득 필수". PR #20 이후 classifier (Mac mini 26B 신규) + evidence (triage→Mac mini 26B 통합) 모두 gate 외부 실행 — concurrent 안전성 별 검토 명시. 1주 관찰 결과: race 빈번. 본 PR-Hermes-Docsrv-Search-1 Layer 1 fixture 측정: - 8/10 query "conservative_refuse(no_classifier)" — classifier 가 동시 부하 시 거의 모두 ReadTimeout 또는 wait_for(6s) timeout - evidence ev_ms=15005 — synthesis 와 race 로 15s 누적 영향: - ask total 시간 증가 (parallel race → serialized): query_analyzer 5s + classifier 3-5s + evidence 5s + synthesis 30s ≈ 40-45s 상한 (현실 평균) - 응답률 ↑: race timeout 으로 인한 conservative_refuse 해소 - 사용자 체감: 빠른 거절 → 의미있는 답변. 단 대기 시간 ↑ 후속: - skill `docsrv_ask` curl `--max-time 20` → 60s 상향 필요 (별 PR 또는 본 PR 안의 follow-up) - 본 메모리 `2026-05-21 Mac mini 26B 1주 부하 측정` observation 의 결정 outcome: gate 복귀 (triage 별 작은 모델 재도입 옵션은 보류)	2026-05-16 19:54:55 +09:00
Hyungi Ahn	5846baedc7	fix(search): ask classifier wait_for 6s → 15s (outer wrapper override 해소) A1 (LLM_TIMEOUT_MS 5→15→30) + config(10→15→30) 후속 진단: 8/10 fixture query 가 "classifier ok" 또는 "classifier error" 로그 없이 conservative_refuse(no_classifier) 경로. search.py:518 의 outer wrapper `asyncio.wait_for(classifier_task, timeout=6.0)` 가 classifier_service.LLM_TIMEOUT_MS 와 httpx timeout 모두 override. 6s 한계 → 동시 부하 시 거의 모든 classifier 호출 6s 안에 못 끝남 → AsyncIO TimeoutError → ClassifierResult("timeout") → refusal_gate 가 verdict=None 받아 conservative_refuse. 15s 로 상향 — classifier_service 내부 30s 와 align 하지 않은 이유 = ask 응답 시간 상한 유지 (evidence parallel 종료 후 추가 9s 대기 cap). Mac mini 26B 동시 부하 시 실측 elapsed 11-14s 까지 자주 발생 → 15s 가 합리 균형. 본 fix 가 진짜 closure 효과. PR-Hermes-Docsrv-Search-1 Layer 1 fixture 의 8/10 no_classifier 경로 해소 예상.	2026-05-16 19:46:49 +09:00
Hyungi Ahn	a332a8aabe	fix(search): classifier timeout 15s → 30s (concurrent load 2x margin) A1+config(15s) 후속 진단: voice memo PoC plan 호출 elapsed_ms=14432 — 15s 한계 거의 밀착. Mac mini 26B 동시 부하 (classifier + evidence + synthesis 3-way) 시 빈번 ReadTimeout 잔존. 30s 로 2x 마진 확보 — config.yaml + classifier_service.py 양쪽 align. Phase 3.5 guardrail 동작 자체에는 영향 없음 (timeout 시 fallback 경로 동일). 향후 별 트랙 (DS-Mac-mini-26B-Concurrent-Load-1): asyncio.Semaphore 도입으로 Mac mini 26B 동시 호출 제한 vs triage 만 작은 모델 재도입. 본 PR 은 timeout 완화만.	2026-05-16 19:42:49 +09:00
Hyungi Ahn	542b6a0084	fix(search): classifier error log type+repr (empty-msg exception 진단) PR-Hermes-Docsrv-Search-1 Layer 1 fixture 가 classifier error: <빈 메시지> 빈번 발생 보고. isolation 직접 호출은 3/3 성공, 동시 부하 (ask endpoint 의 classifier + evidence parallel) 시에만 발생. Exception type + repr 캡처해서 root cause 식별 (httpx.ReadTimeout / TimeoutError / ConnectionError / 기타 무엇인지). 식별 후 후속 PR (DS-Classifier-Concurrent-Load-1) 에서 본격 mitigation.	2026-05-16 19:08:23 +09:00
Hyungi Ahn	c769ad14ad	fix(search): classifier LLM_TIMEOUT_MS 5s → 15s (Mac mini 26B concurrent load) PR #20 (`f139945`) GPU LLM 제거 후 Mac mini 26B 가 triage + classifier + chat + STT 동시 흡수. classifier_service hardcoded 5s timeout (config.yaml `timeout: 10` 무시) 이 동시 부하 시 빈번 초과 → CIRCUIT_THRESHOLD(5) 누적 → circuit 60s open → verdict=None → refusal_gate conservative_refuse(no_classifier) 경로. 실측: 정상 부하 단독 호출 = 2.3s (500 prompt + 49 completion tokens), 동시 호출 시 ev_ms/synth_ms 가 15s 까지 누적 — 5s 한계가 architectural mismatch. 15s 로 상향 → classifier 정상 verdict 반환 → refusal_gate 가 classifier 의 sufficient/insufficient 사용 (conservative fallback 회피). 본 fix 는 [[2026-05-21 Mac mini 26B 1주 부하 측정]] observation 의 회귀 결과로 자연 정리. config.yaml `classifier.timeout: 10` 와는 별 변수 — 본 1줄은 코드 내 한계, config 항목은 별 PR (Config-Driven-Timeout-1) 에서 통합 검토. 발견 경로: PR-Hermes-Docsrv-Search-1 Layer 1 fixture (curl direct, 10/10 ask) 가 conservative_refuse(no_classifier) 8건 + timeout 2건 보고. fastapi log "classifier circuit OPEN for 60s" + "classifier timeout" 페어 발견.	2026-05-16 19:02:55 +09:00
Hyungi Ahn	19bf5b1e38	feat(memo): Hermes input gateway — source_channel='hermes' + source_metadata jsonb PR-Hermes-Docsrv-Bridge-1 v1. Hermes Agent (Mac mini Discord) 를 Document Server 입력 게이트웨이로 reframe — 코딩 executor X, Claude Code 변동 0. 변경: - migration 267: source_channel enum 에 'hermes' 추가 - migration 268: documents.source_metadata jsonb NOT NULL DEFAULT '{}' 추가 - Document model: source_metadata 컬럼 ORM 매핑 + enum 'hermes' 노출 - MemoCreate: source_channel + source_metadata 필드 수용 (default='memo' 호환) - create_memo: channel allowlist (memo/voice/hermes) + metadata jsonb 저장 - list_memos: IN tuple 에 'hermes' 추가 (inbox 노출) - MemoResponse + _to_memo_response: source_metadata 노출 (UI 배지 준비) LLM 호출 0 — Hermes 의 HTTP POST 만. 분류/요약은 classify_worker 비동기 처리. promote-to-event guard (562/664) 변경 0 — v1 = hermes 메모 promote 차단 유지. plan: ~/.claude/plans/idempotent-seeking-hollerith.md Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-16 13:44:15 +09:00
Hyungi Ahn	3627060d2a	fix(ingest): devonagent extract md_status 'ready' → 'success' documents_md_status_check 제약은 {pending/processing/success/partial/failed/skipped} 만 허용. extract_worker 의 web HTML 분기가 'ready' 박아서 CheckViolationError 로 3회 실패. plan/docs/메모리에 'ready' 로 잘못 표기됐던 것 수정. 19668 (첫 sample doc) 검증 중 발견. fix 후 queue 'failed' 행 reset 으로 재실행. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-16 08:42:15 +09:00
Hyungi Ahn	0cbba0ceeb	feat(ingest): devonagent 트랙 Phase 1 ingest 활성화 DEVONagent/DEVONthink 가 발견한 웹페이지를 NAS Web/ drop → file_watcher ingest → extract 4-tier fallback (trafilatura/sibling-md/readability/bs4) → embed + chunk 까지. classify/preview/markdown SKIP. - source_channel='devonagent' (migration 001 dormant 활성화) - file_watcher: SCAN_TARGETS 통합 + Web/ rglob + canonical_url dedup + sidecar 누락 정책 (skip 안 함, web_meta.sidecar_missing=true flag) - extract_worker: HTML+devonagent 분기 + md_extraction_engine 4-tier 구분 (trafilatura → sibling .md ≥200char → readability+markdownify → bs4_text) - queue_consumer: enqueue_next_stage 의 extract stage 만 source_channel- aware override (devonagent → [embed, chunk]) - classify_worker: devonagent safety skip (law_monitor 패턴 mirror, ai_domain='Web', ai_tags=['Web/{host}']) - requirements: trafilatura/readability-lxml/markdownify 추가 - docs: devonthink-web-bridge.md 설치 가이드 + first-wins 정책 명시 Phase 1 closure 기준 = 재료 품질 (검색 가능 + 노이즈율 + dedup + 엔진 분포). 활용처(ai_tldr/digest/PKM 회고)는 1-2주 OR 30-50건 관찰 후 별 PR 에서 결정. Plan: ~/.claude/plans/db-snuggly-petal.md Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-15 21:23:16 +09:00
hyungi	118f32f9b1	refactor(ai): PR #20 reframe cleanup — Ollama LLM 잔재 주석 정정 PR #20 (2026-05-14, GPU LLM 제거 + Mac mini 26B MLX 흡수) 의 swap 이 backends.json + 코드 주석/docstring 까지 따라가지 못한 표현 잔재 정리. - app/ai/client.py: AIClient docstring 및 call_triage / call_fallback docstring 의 "4B Ollama" → "Mac mini 26B MLX" / "현재는 triage 와 동일 엔드포인트" → "Claude Sonnet 4 API (PR #20 swap 완료)" - app/core/config.py: triage/primary/fallback 주석 통합 + Phase 3.5 classifier/verifier 주석에 PR #20 endpoint 명시 (history 보존) - app/services/search/{llm_gate,classifier_service,verifier_service, evidence_service}.py: "fallback(Ollama)" / "Ollama concurrent OK" / "triage(4B Ollama)" 표현을 Mac mini 26B MLX endpoint 기준으로 정정 + concurrent 안전성 별 검토 마커 추가 - app/services/digest/summarizer.py: "MLX hang/Ollama stall 방어" → "MLX hang / fallback Claude API stall 방어" - app/services/prompt_versions.py: SUMMARY_TRIAGE_TASK + ASK_PROMPT_VERSION 주석의 "4B Ollama" / "4B gemma Ollama" → Mac mini 26B MLX - app/workers/classify_worker.py: B-1 tier triage docstring 정정 코드 동작 변경 0 (주석/docstring 만). embed_worker / study_question_embed_worker 의 "Ollama bge-m3" 표현은 사실 정확이라 유지. 검증: - ollama list → bge-m3:latest 잔존 (embedding owner) - /api/embeddings probe → 1024-dim 200 OK - fastapi embed/ollama error 0 (last 10min) - document.hyungi.net 200 plan: ~/.claude/plans/4-stateless-dongarra.md Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-15 12:09:15 +00:00
Hyungi Ahn	08cf676c26	fix(news): news 문서 chunk stage enqueue 추가 + 7일 백필 스크립트 document_chunks.country 가 7일 분포 기준 99.9% NULL 이었던 root cause = news_collector 가 summarize + embed 만 enqueue 하고 chunk 를 enqueue 하지 않아 chunk_worker 가 news 문서에 한 번도 안 돌고 있었음. queue_consumer.next_stages 의 summarize 키 부재가 follow-up 미연결 원인. news 외 summarize 흐름 부수영향 회피를 위해 next_stages 가 아니라 news_collector RSS/API 양쪽에 chunk enqueue 1줄씩 명시 추가. days_old <= 30 가드 안에서 embed 와 동일 정책. scripts/news_chunk_country_backfill.py — doc 단위 small batch, 실패 doc skip, 50건마다 progress. queue 우회 직접 chunk_worker.process 호출로 timing 통제. Gate (PR closure): A) chunked_doc_pct > 95% 최근 7일 news doc 중 chunk 보유 비율 B) country null_pct < 5% 최근 7일 news chunk country NULL 비율 plan: ~/.claude/plans/7-whimsical-crab.md (PR-News-Prep-Layer-1) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-15 16:35:53 +09:00
hyungi	5125f82d4a	feat(study): Mac mini derived-worker (PR-MacMini-Derived-Worker-1) GPU = RAG context provider, Mac mini = LLM 가공 공장. GPU 측 변경: - app/api/internal_study.py: GET /internal/study/explanation-context/{qid} Bearer auth, gather_explanation_context + _render_envelope_prompt 재호출. 204=evidence missing, 410=deleted/ready. - app/workers/study_queue_consumer.py: settings.study_explanation_enabled false 시 explanation 분기 skip (status/attempts 미변경, pending 유지 → Mac mini 흡수). - app/core/config.py: study_explanation_enabled + internal_worker_token 2 setting. - app/main.py: internal_study_router include (prefix /internal/study). - docker-compose.yml: fastapi ports → 100.110.63.63:8000 Tailscale bind, STUDY_EXPLANATION_ENABLED + INTERNAL_WORKER_TOKEN env 추가. Mac mini 측: ~/derived-worker/ (별도 push 0, 어제 작성). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-15 03:13:43 +00:00
Hyungi Ahn	261036c7b2	ops(file-watcher): idle fire 로그 가시화 watch_inbox() 가 new_count/changed_count 둘 다 0 일 때 silent — PR-NAS-Watch-Folder 검증 시 fire 추적 부재 확인 후 보완. else 분기 추가해 매 5min fire 마다 "변경 없음 (idle)" info 로그 한 줄. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-14 13:32:38 +09:00
Hyungi Ahn	52f86acda7	feat(auth): voice-memo bot 365d access token (PoC v1) bot 계정(`voice-memo-bot`) 한정 long-expiry access token 발급 경로 추가. 일반 사용자 흐름 영향 0 (env gate default false). - core/auth.py: create_voice_memo_bot_token() 신규 (env gate + username hard-match) - api/auth.py: login route 에 bot 분기 (bot 이면 long token 반환, 일반은 기존 흐름) - docker-compose.yml: 3 env (VOICE_MEMO_BOT_TOKEN_ENABLED/_USERNAME/_EXPIRE_DAYS) default false OpenClaw `/voice-memo` plugin → DS `/memos/` Bearer proxy 의 auth 기반. 정식 service-account/api_keys 테이블은 Phase 2 (multi-service 인입 추가 시점). plan: ~/.claude/plans/rosy-launching-otter.md project: ~/.claude/projects/-Users-hyungiahn/memory/project_voice_memo_pipeline.md Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-13 12:24:18 +09:00
hyungi	1293c7094a	Merge pull request 'feat/news-tech-ai-sources' (#17 ) from feat/news-tech-ai-sources into main Reviewed-on: #17	2026-05-13 07:54:59 +09:00
hyungi	4b8120d83f	feat(briefing): date picker + 카드별 읽음/하이라이트 액션 사용자 요청 (2026-05-13): - 오늘 briefing 만 보여주고 과거 못 보는 게 아쉬움 → 날짜 선택 UI - 시간대 별 나열은 오히려 불편 → date dropdown 1단계 선택 - 각 카드에 읽음/하이라이트 토글 Schema (migrations 263~266, 단일 statement): - briefing_topics.is_read BOOL NOT NULL DEFAULT false - briefing_topics.read_at TIMESTAMPTZ - briefing_topics.highlighted BOOL NOT NULL DEFAULT false - briefing_topics.highlighted_at TIMESTAMPTZ API (app/api/briefing.py): - TopicResponse 에 id / is_read / read_at / highlighted / highlighted_at 추가 - GET /api/briefing/dates → 사용 가능 날짜 목록 (60일 cap) · briefing_date / total_topics / total_articles / status / read_count / highlighted_count - PATCH /api/briefing/topics/{id}/read body {value: bool} → 읽음 토글 - PATCH /api/briefing/topics/{id}/highlight body {value: bool} → 하이라이트 토글 - 토글 시 *_at 컬럼 자동 설정/NULL UI (frontend/src/routes/news/+page.svelte): - 헤더 우측 <select> date dropdown — 최신 + N일치 (highlighted_count 별 표시) - 선택 시 /api/briefing?date=… 로 해당 날짜 briefing 로드 - 카드 우측 상단 ★ (하이라이트) + 읽음 버튼 - 하이라이트 = Card class ring-2 ring-yellow-400 - 읽음 = 외부 div class opacity-60 (시각 차분화, 펴기 가능) - 토글 즉시 PATCH 호출 + 로컬 state 갱신 each key topic.topic_rank → topic.id 변경 (이미 unique).	2026-05-12 22:05:06 +00:00
hyungi	1d3d61d31e	fix(briefing): lower clustering threshold 0.78 → 0.70 배포 후 관측 결과 (2026-05-13 새벽): - 126 docs / 7 countries 인데 THRESHOLD=0.78 로 raw_clusters=124, dropped_min_articles=122, kept=1. - 거의 매 article 이 별 cluster 로 갈려 토픽 묶음 실패. - 같은 cron 어제 (5/12) 는 101 docs 에서 6 topics 성공 — 그날 뉴스가 우연히 같은 토픽으로 더 모인 case. 수동 측정 (5/13 동일 docs): - 0.78 → kept=1 - 0.70 → kept=5 (allowed) 영구 변경 = THRESHOLD=0.70. cross-country 필터 (MIN_COUNTRIES≥2) + min_articles(≥2) 그대로 유지하므로 noise topic 위험은 제한적. 원본 주석 (0.75~0.80 중간값) 도 갱신.	2026-05-12 21:44:00 +00:00
hyungi	2dbbeac1c7	fix(daily_digest): cast today to date object for KST comparison 매일 20:00 KST cron fire 시 fail: UndefinedFunctionError: operator does not exist: date = character varying 원인: today 가 strftime("%Y-%m-%d") 로 string, func.date(created_at) 가 date 타입. PostgreSQL 가 date = string 비교 거부. Fix: today = datetime.now(ZoneInfo("Asia/Seoul")).date() — date 객체로. KST 기준은 scheduler cron 이 KST 20:00 에 fire 되므로 자연 일치. scope: app/workers/daily_digest.py:24	2026-05-12 21:30:41 +00:00
hyungi	138f689c98	fix(scheduler): pass KST timezone to all CronTriggers AsyncIOScheduler(timezone="Asia/Seoul") 의 scheduler-level timezone 이 CronTrigger 에 자동 전파되지 않아 6 cron 모두 UTC 로 fire 되던 버그. 영향 (모두 9h 오차): - morning_briefing 의도 05:10 KST → 실제 14:10 KST - daily_digest 의도 20:00 KST → 실제 05:00 KST (다음날) - global_digest 의도 04:00 KST → 실제 13:00 KST - law_monitor 의도 07:00 KST → 실제 16:00 KST - mailplus_morning 의도 07:00 KST → 실제 16:00 KST - mailplus_evening 의도 18:00 KST → 실제 03:00 KST (다음날) Fix: 모든 CronTrigger 에 timezone=KST (= ZoneInfo("Asia/Seoul")) 명시. 검증 (재시작 후): law_monitor next: 2026-05-13 07:00 KST mailplus_morning next: 2026-05-13 07:00 KST mailplus_evening next: 2026-05-13 18:00 KST daily_digest next: 2026-05-13 20:00 KST global_digest next: 2026-05-14 04:00 KST morning_briefing next: 2026-05-14 05:10 KST	2026-05-12 21:30:34 +00:00
Hyungi Ahn	55e39818ec	feat(briefing): register 05:10 KST APScheduler cron 매일 KST 05:10 morning_briefing_run 자동 실행. scheduler timezone=Asia/Seoul 이라 hour=5 minute=10 만 명시. Phase 4 04:00 cron 종료 후 70분 buffer + MLX semaphore 충돌 회피.	2026-05-12 14:54:20 +09:00
Hyungi Ahn	6966be9cf6	fix(briefing): backfill country_perspectives[].article_ids from cluster members LLM 이 article_ids 를 자율적으로 비워두는 케이스 (2026-05-12 첫 briefing 6 topics 모두 빈 list) 를 서버에서 보정. 후처리 정책 (_resolve_article_ids): 1. LLM 이 준 id ∩ cluster member id (엉뚱한 id 차단, hallucination 방어) 2. 비어있으면 같은 country cluster member top weight N 개 자동 주입 3. cluster 안 country 매칭 멤버 0 → [] per-country cap = MAX_ARTICLE_IDS_PER_COUNTRY = 5. weight 내림차순. API 계약 강화: country_perspectives 가 있는 topic 은 article_ids ≥ 1 보장 (같은 country cluster member 존재 시). frontend / 외부 채널 / archive UI 모두 신뢰 가능. tests 3 케이스 추가.	2026-05-12 13:15:26 +09:00
Hyungi Ahn	431d4fe010	feat(briefing): add morning briefing schema + services + api (historical off) 야간 수집 뉴스 (KST 00:00~05:00) topic×country 비교 분석 1페이지 카드. Phase 4 Global Digest 와 코드/로직/테이블 분리, 알고리즘만 services/clustering_common 공유. Backend 신규: - migrations/255_morning_briefings.sql: morning_briefings + briefing_topics (briefing_date UNIQUE, UNIQUE(briefing_id,topic_rank), FK CASCADE, historical_* 3컬럼 nullable, cluster_members JSONB, country_perspectives JSONB, status 4-state success\|partial\|failed\|empty) - app/models/briefing.py: SQLAlchemy ORM - app/services/briefing/loader.py: KST 5h 윈도우 + news_sources prefix fallback (Phase 4 패턴 미러) + historical candidate pool 로더 - app/services/briefing/clustering.py: cluster_global topic-first (LAMBDA=ln(2)/2h, MIN_COUNTRIES_PER_TOPIC=2, MAX_TOPICS=7) - app/services/briefing/comparator.py: call_primary 26B + JSON envelope sanitize (cap perspectives 10 / divergences 3 / convergences 2 / quotes 5) + fallback row 고정 형태 + retrieve_historical cosine top-K - app/services/briefing/pipeline.py: load→cluster→select(K=7,λ=0.6) →historical→compare→status 4-state→delete+insert transaction - app/workers/briefing_worker.py: APScheduler/수동 호출 공용 진입점, 600s hard cap - app/prompts/briefing_comparative.txt: 한국어 비교 분석 JSON 프롬프트, {articles_block} + {historical_block} 2섹션, 인용 금지 라벨 - app/api/briefing.py: GET /latest, GET ?date=, POST /regenerate?date= (admin, sync delete+insert tx, regenerated:true) Backend 수정: - app/main.py: briefing_router 등록 (/api/briefing prefix). scheduler 등록은 PR-3 에서. - app/services/digest/selection.py: select_for_llm 매개변수화 (K, λ caller 주입). Phase 4 동작은 default 값으로 보존. Historical 정책: - BRIEFING_HISTORICAL_ENABLED env flag, default off. - flag off → historical_* 컬럼 모두 NULL, prompt {historical_block} 빈 라벨, retrieval 호출 안 함. - flag on (PR-1b 에서 enable) → cluster centroid 와 과거 30일 doc embedding cosine top-K 5 (sim≥0.70), prompt 에 주입. Country canonical (실측 확인 후): - documents.country 컬럼 부재 확정 - document_chunks.country 매칭률 0% (chunks 자체가 뉴스에 안 만들어짐) - 유일 country 신호 = news_sources prefix 매핑 (Phase 4 와 동일) Tests: - tests/test_briefing_historical.py: 3 경로 회귀 (flag off/on with fixture/on zero match) + sanitize cap + fallback row 형태. Verification: PR-1.8 에서 GPU 컨테이너 pytest + 수동 regenerate.	2026-05-12 12:58:50 +09:00
Hyungi Ahn	1ca6d8b522	refactor(digest): extract clustering helpers to clustering_common Phase 4 Global Digest 의 클러스터링 핵심 알고리즘 (time-decay weight, adaptive threshold, greedy cosine assign + EMA centroid, importance normalize) 을 `app/services/clustering_common.py` 로 추출. country 축은 caller 책임 — Phase 4 cluster_country 는 그대로 country 별 호출, 신규 morning briefing 모듈이 country 없이 cluster_global 로 호출 예정. selection.py 의 중복 _normalize 도 공통 util 로 통일. 동작 변경 0: - LAMBDA / threshold / EMA alpha / MIN_ARTICLES 모두 Phase 4 기본값 유지 - docs.sort (in-place) → sorted (copy) 변경했으나 caller 가 정렬된 docs 를 재사용하지 않으므로 무관 (dict element 의 weight 부여는 reference 라 그대로 반영) 다음 commit 에서 Phase 4 회귀 검증 (digest regenerate diff 0).	2026-05-12 12:38:32 +09:00
Hyungi Ahn	3dc78e4f94	fix(memos): voice memo file_type → 'immutable' (doc_type enum 호환) GPU 서버 main pull 후 /api/memos/?archived=false 가 500 — doc_type enum 에 'audio' 값 없음 (immutable/editable/note 만). list_memos WHERE file_type IN ('note', 'audio') 가 invalid_text_representation. 수정: - voice upload Document.file_type = 'audio' → 'immutable' (기존 audio 컨테이너 인입과 같은 패턴: file_type='immutable' + category='audio' + source_channel='voice') - list_memos 필터에서 file_type 조건 제거 (source_channel IN ('memo','voice') 만으로 분리 — file_type='immutable' 필터는 일반 PDF 까지 끌어옴, 위험) - module docstring + voice upload 주석 업데이트 원본 plan 의 file_type='audio' 결정은 doc_type enum 미확인이 원인. enum 확장(ALTER TYPE ADD VALUE 'audio') 대신 기존 패턴 재사용 — 안전 + 회귀 X.	2026-05-11 12:28:58 +09:00
Hyungi Ahn	6490050b04	feat(memos): promote memo to event + voice memo upload endpoint PR-2B/2C backend 2/2. plan v9 commit 분할 2~3 통합 (memos.py 단일 파일 변경). PR-2B promote-to-event: - POST /api/memos/{memo_id}/promote-to-event — 메모 → events 1-click 승급 · kind 결정: body.kind > documents.ai_event_kind > 400 · activity_log 면 status=done + ended_at=now() 자동 (5초 행동 기록 UX) · calendar_event + start_at 있으면 status=scheduled · Event row + events_history(create) 자동 생성 · memo_document_id 자동 link + source='memo' + raw_metadata 에 AI 추천값 보존 · 한 메모 → N events 가능 (사용자 의도에 따라 dedup 없음) - POST /api/memos/{memo_id}/dismiss-event-suggestion — '그냥 메모' (ai_event_kind='note' 강제) · MVP: AI 추천값과 사용자 확정값 같은 컬럼 (정확도 측정 흐려질 수 있음) · 백로그: user_event_kind 별 컬럼 분리 (plan Memo Intake Upgrade 백로그) - MemoResponse 확장: ai_event_kind / ai_event_confidence / source_channel / file_type / file_path - list_memos 필터 완화: file_type IN (note, audio) + source_channel IN (memo, voice) → voice 메모도 같은 inbox list 에 표시 (사용자 의도: 메모 = 모든 입력의 inbox) PR-2C voice upload: - migration 254: ALTER TYPE source_channel ADD VALUE 'voice' - POST /api/memos/voice (multipart audio + recorded_at + device_hint) · 검증: Content-Type audio/* + size ≤ 50MB + 확장자 화이트리스트 · NAS 저장: /documents/PKM/Recordings/{YYYY-MM}/{uuid}.{ext} · fsync + rename(atomic) 패턴 (NAS soft mount 안전) · Document row: file_type='audio' + source_channel='voice' + category='audio' · enqueue stt 큐 → 기존 stt_worker → classify (PR-2B triage) → embed → chunk · extract_meta 에 device_hint / recorded_at 보존 - 응답: MemoResponse (file_path 포함, frontend audio player 용) 원칙: AI worker 는 events row 직접 생성 X. 본 endpoint 가 사용자 의도 channel.	2026-05-11 12:06:41 +09:00
Hyungi Ahn	63990ac632	feat(memos): add AI event-kind triage fields PR-2B (Memo Inbox Triage) backend 1/2. plan: beszel-tingly-sloth.md 라운드 13. 사용자 비전 = 메모는 inbox, AI 는 triage assistant. AI worker 는 events row 직접 생성 X. Migrations 250–253 (실측 N=250): - 250 CREATE TYPE event_kind_hint AS ENUM (note\|task\|calendar_event\|activity_log\|reference) - 251 ALTER TABLE documents ADD ai_event_kind event_kind_hint - 252 ALTER TABLE documents ADD ai_event_confidence NUMERIC(3,2) + CHECK 0–1 - 253 CREATE INDEX idx_documents_ai_event_kind partial WHERE ai_event_kind IS NOT NULL ORM: - Document.ai_event_kind / ai_event_confidence 컬럼 추가 (Enum SQLAlchemy 동기) - source_channel enum 에 'voice' 추가 (PR-2C 와 호환) Worker: - classify_worker Phase 3 (Gemma 4B triage) 확장 · TriageOutput 에 event_kind_hint + event_kind_confidence 필드 추가 · 4B 응답에 hint 가 있을 때만 Document 에 저장 (enum 외 값은 무시) - prompt p3a_short_summary.txt 확장 — note/task/calendar_event/activity_log/reference 분류 기준 + confidence + default='note' 명시 원칙: AI worker 는 hint 만 제공. events 생성은 다음 commit 의 promote endpoint 에서만.	2026-05-11 12:04:21 +09:00
hyungi	a842dc682e	Merge pull request 'wip/gpu-main-snapshot-2026-05-11' (#7 ) from wip/gpu-main-snapshot-2026-05-11 into main Reviewed-on: #7	2026-05-11 08:11:44 +09:00
Hyungi Ahn	6d71116553	feat(events): PR-2 UI MVP — 4-tab + 빠른 행동 기록 + 상세/생성/이력 plan v6 PR-2 scope. 5초 행동 기록 UX 가 핵심 가설. Backend: - GET /api/events/{id}/history — events_history timeline 조회 (lifecycle op 자동 기록) Frontend (SvelteKit 5 runes mode): - /events 메인 — 4-tab (오늘/Inbox/예정/활동) + 빠른 행동 기록 widget · 단일 입력 + Enter → POST /api/events kind=activity_log · status=done + 시간 default 채워짐 (서버 측) → Activity 탭 즉시 반영 · 새 항목을 list 최상단 prepend (refetch 불필요) · 연속 입력 위해 입력 ref focus 유지 · lifecycle 버튼 (complete/defer/cancel/reactivate) — activity_log 는 lifecycle 대상 X - /events/[id] 상세 — PATCH 허용 필드 edit (title/desc/시간/priority/project_tag) + history timeline · PATCH 금지 필드는 UI 노출 X (status/completed_at/cancelled_at/defer_until 은 별 버튼) - /events/new — kind 선택 (task/calendar_event/activity_log) 후 필드 분기 form · task: due_at + start_at (선택, "14:00 전화" 같은 시각 task 허용 — 라운드 10) · calendar_event: start_at 필수 + end_at + all_day · activity_log: started_at/ended_at 비우면 서버 default now() - Sidebar 메모 옆에 events 진입점 (CalendarCheck icon) API helpers: frontend/src/lib/utils/events.ts (createEvent / logActivity / list* / lifecycle ops / kind&status enum label/color). quickref doc: docs/events_api_quickref.md (이전 commit, PR-2 frontend reference). PR-2 핵심 가설 검증 = 빠른 입력 → 저장 → Activity 즉시 반영 → 새로고침 유지. PR-1 deferred HTTP behavior 5건도 본 UI 의 자연 사용으로 닫힘.	2026-05-11 07:56:31 +09:00
Hyungi Ahn	9d9b3359b0	feat(events): PR-1 Events Core — schema + ORM + 최소 API 개인 운영 로그 / 일정 / 할 일 / 회고용 1차 컨테이너 도메인 신설. plan: ~/.claude/plans/beszel-tingly-sloth.md (라운드 12 v6). Schema: - enum 5종 (event_kind / event_status / event_source / event_actor / history_change_kind) - events 테이블: kind(task\|calendar_event\|activity_log) + lifecycle 7-state status - events_history: lifecycle op 자동 기록, FK RESTRICT (이력은 시점 사실) - CHECK: calendar_event → start_at NOT NULL / activity_log → started_at\|ended_at NOT NULL - partial unique (source, source_ref) — 외부 source dedup (PR-4 활용) - partial index (active status / activity_log timeline) API: - POST /api/events (kind=activity_log shortcut: status=done + ended_at=now() default) - GET /api/events/{id} \| /api/events?kind&status&from&to&project_tag&source - PATCH /api/events/{id} (extra=forbid + 시간 필드 변경 시 reschedule history) - POST /api/events/{id}/{complete,cancel,defer,reactivate} (history 자동) - GET /api/events/today (Asia/Seoul default, deferred 는 defer_until<=now() 만) - GET /api/events/inbox \| /api/events/activity?from&to 제외 (PR-2~5 또는 백로그): - DELETE (회고 데이터 → /cancel 일관화) - log shortcut / upcoming endpoint (POST + GET ?from&to 로 흡수) - /ingest (PR-4 MailPlus forward 시 정확한 요구로 추가) - iCal export / ntfy 알림 / recurrence / 일반 edit history	2026-05-11 07:19:04 +09:00
Hyungi Ahn	aca2f0d62c	feat(canonical): restore GPU STT owner and extend KGS watch paths D9 Track B revised (2026-05-08): 1) STT owner GPU 정식 복귀: - docker-compose.yml: stt-service profiles:[legacy] 제거 → 상시 활성 - fastapi STT_ENDPOINT = http://stt-service:3300 (compose 내부 DNS) - 정책: Mac mini = Gemma 26B 전용 우선이므로 STT/Whisper 는 호출량 무관 GPU 서버 소유. 이전 "Mac mini 이전본" 주석은 trace 오인 기반. 2) KGS Code 등 외부 학습 자료 추가 스캔 경로: - ADDITIONAL_WATCH_TARGETS env (쉼표 구분, PKM 상대경로) - app/core/config.py: additional_watch_targets list 설정 추가 - app/workers/file_watcher.py: 추가 watch path 처리 - app/workers/classify_worker.py: KGS Code 분류 분기 (가스기사 학습 자료) - 모두 expected_category=library 처리 (md/pdf/docx 만) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-10 05:47:20 +00:00
Hyungi Ahn	8ca27eb573	fix(markdown): img auth via ?token= query param (Authorization header 미지원) `<img src=>` 가 Authorization header 를 못 보내서 /api/documents/{id}/images/{key}/raw 가 401 반환 → 이미지 안 보임. 기존 /file?token= iframe 패턴과 동일하게 access token 쿼리 파라미터로 전달. backend: get_current_user 의존성 제거하고 token 쿼리 파라미터 직접 검증 (기존 /file 엔드포인트와 동일 흐름). frontend: MarkdownDoc 의 swap selector 가 img.src 에 ?token={getAccessToken()} 부여. 로그아웃 상태면 placeholder 유지. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-10 14:47:09 +09:00
Hyungi Ahn	68fa86ea52	feat(markdown): persist extracted images with auth routes Markdown Canonical Phase 1B.5 — marker 가 추출하던 이미지를 NAS 에 영구 저장하고 DB 메타 + 인증 라우트 + 프론트 swap 까지 wiring. 핵심 변경: - marker-service /convert 응답에 base64 image 리스트 포함 (stateless 유지, NAS write 권한 X) - marker_worker 가 NAS `/documents/extracted_images/{doc_id}/` 에 persist + UPSERT + 고아 row DELETE + md_content ref 를 `docimg:img_NNN` stable scheme 으로 정규화 - /api/documents/{id}/images/{key}/raw 인증 라우트 (Cache-Control private + ETag = content_hash) - frontend MarkdownDoc 가 placeholder card 안의 docimg ref 를 실제 <img> 로 swap 원칙: - 이미지 binary = NAS, metadata = Postgres (학습 섹션 패턴 동일) - image_key sequence 기반 결정적 → 재변환 idempotent - MARKDOWN_IMAGE_PERSIST=false env 로 rollback 가능 (placeholder card 폴백 자연 유지) 기존 28건 marker success 문서는 본 PR 에서 건드리지 않음 — deploy + 신규 업로드 1건 + sample 5건 검증 후 scripts/marker_reprocess_existing_success.py 로 targeted reprocess. plan: ~/.claude/plans/piped-humming-crystal.md Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-10 14:05:41 +09:00
Hyungi Ahn	5b62c59f8a	fix(canonical): marker_worker transport 계층 오류는 transient retry 분류 기존: (ConnectError, TimeoutException) 만 transient → raise → queue retry. ReadError / WriteError / RemoteProtocolError 같은 다른 transport 류는 'except Exception' 이 잡아 _fail 처리 → max_attempts 무시하고 final fail. Phase 1D pilot 에서 5111/5115 두 건이 'Server disconnected without sending a response' (RemoteProtocolError) 로 retry 없이 final fail. Fix: except (ConnectError, TimeoutException) → except TransportError. TransportError 가 Connect/Read/Write/RemoteProtocol/Timeout 의 공통 부모 라서 모든 transport 계층 오류가 transient queue retry 대상이 됨. 5135 의 ReadTimeout (queue exhausted) 는 본 fix 와 별개 — 8.4MB PDF 가 MARKER_TIMEOUT=300s 안에 못 끝나 3번 retry 다 timeout. timeout 자체를 늘리거나 큰 PDF 분할 처리하는 별도 결정 필요. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-03 15:29:47 +09:00
Hyungi Ahn	5185501bbd	feat(search): PR-RAG-Time-1 freshness decay (news/law_monitor) 뉴스/법령 알림 retrieval 결과에 시간 가중치 soft multiplier 적용. reranker 이후 final score 합성 단계에서 운영 정책 단계로 분리. - news (source_channel='news'): half-life 90일 - law_monitor (source_channel='law_monitor'): half-life 365일 - 비적용: manual / drive_sync / inbox_route / memo / Manual / Reference / Academic_Paper / Checklist / KGS Code / Study / content_origin='ai_drafted' - formula: decay = exp(-ln(2) * age / HL); final = base * (0.7 + 0.3 * decay) - floor 0.7 (완전 demote 금지) - 가드: missing date / future date / unknown source 모두 no-op - 임시 date source: documents.created_at (published_date 컬럼 부재 — 후속 PR) debug 메타 (?debug=true 응답 + logs/search.log): base_score / age_days / decay_factor / freshness_adjusted_score / freshness_policy / freshness_date_source 신규: app/services/search/freshness_decay.py hook: app/services/search/search_pipeline.py:303 (apply_diversity 직후, normalize 직전) schema: app/api/search.py SearchResult.freshness_debug (Optional[dict]) tests: tests/test_freshness_decay.py 24 case (정책 디스패처 9 + age/decay/score 11 + apply integration 6 — guard 1~6 all) Episode/Fact layer 와 contradiction detection 은 본 PR 스코프 외. plan: ~/.claude/plans/pr-rag-time-1-freshness-decay.md	2026-05-03 08:38:09 +09:00
Hyungi Ahn	7d0fca267d	feat(marker): handwritten 자동 skip — Phase 1D pilot 결과 반영 1D pilot (2026-05-02 야간 sweep, 25 controlled_backfill 결과) 에서 필기 PDF 3건 (4798 / 4813 / 4815) 이 status='success' 로 변환됐으나 사용자 quality 평가에서 좋은 자료 추출 불가 판정. 근본 원인은 Marker 설정 부족이 아니라 입력 자체 (애플펜슬 손글씨 + 사용자 글씨체 = OCR/ layout 모델 한계 영역). Marker 튜닝으로 해결될 영역이 아니므로 enqueue 단계에서 자동 skip. 가드 로직: marker_worker.process() 의 doc_type SKIP 직후 (1.5 단계) title/path 의 보수적 키워드 4개 (필기, 손글씨, handwritten, handwriting) 매칭 시 _set_skipped() 호출. md_content/md_content_hash NULL clear, md_extraction_error='skipped: handwritten note (title/path heuristic)', content_origin='extracted'. 키워드 선정 (보수적): 포함: 필기 / 손글씨 / handwritten / handwriting 제외 (false positive 위험): - 노트 (노트북 매뉴얼 / release notes / Note_240528_워크숍 같이 필기 아닌 정상 문서까지 잡음) - scan / 스캔 (스캔 PDF 中 정상 변환되는 케이스 있음, 1D 결과 doc 5127 표준기계설계(KS)_08_핀 density 1.59 / scan_likely 인데 성공) logger: markdown_skip_handwritten_hint id=<id> keyword=<matched> title=<...> regex 단위 테스트 15 케이스 (실 production fastapi venv) 전부 통과: 매칭: Note_240805_용접교육 필기 / Note_240827_필기 / 손글씨 모음 / Handwritten Notes 2024 / handwriting practice / path/필기/* / path/handwritten_collection/* (8건) 비매칭: 다이아프람워크숍 / 노트북 매뉴얼 / Release notes v2 / PIPE FABRICATORS / 표준기계설계 / scan documentation / 스캔 문서 (7건) 이번 가드는 enqueue 시점 적용. 이미 success 인 4건의 md_content 는 보존 (사용자가 직접 보고 싶을 때 표시 가능). 정리 필요 시 별건. 후속 (별 PR): - A2 (정식 doc_type='필기노트' 라벨): 1D 3건 sample 너무 적어 라벨 정의 보류. 필기 PDF 누적 후 별도 검토. - C (Phase 2 풀 backfill plan): 본 PR 머지 후 별도 라운드. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-03 08:11:42 +09:00

... 3 4 5 6 7 ...

470 Commits