hyungi
12ac18eb70
fix(collector): 수집기 견고화 — 한 건 실패가 전체 사이클을 죽이던 것 차단
...
C2 csb_collector: 주간 run 의 per-URL 루프에 try/except/continue — URL 1건 실패(page-extract
예외·DB DataError)가 run() 밖으로 전파돼 이후 URL 전부 스킵+watermark 정지하던 것 차단. 각
iteration 자체 session 이라 실패 격리.
H3 news_collector: 공유 세션+종단 단일 commit → 한 소스 DB오류가 오염시켜 전 소스 insert 소실하던
구조를 소스별 독립 세션으로(csb 패턴 동형). 실패 시 rollback 후 깨끗한 상태에서 failure 기록.
실증: 수동 수집서 Taipei Times ReadTimeout 격리하고 327건 정상 완주.
Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com >
2026-06-20 05:42:12 +00:00
..
2026-06-13 09:37:51 +09:00
2026-04-02 10:20:15 +09:00
2026-06-13 06:23:22 +09:00
2026-06-16 13:28:04 +09:00
2026-06-15 15:36:56 +09:00
2026-06-15 03:36:57 +00:00
2026-06-11 07:13:17 +09:00
2026-06-16 14:24:03 +09:00
2026-06-16 14:19:20 +09:00
2026-06-20 05:42:12 +00:00
2026-06-16 14:03:31 +09:00
2026-06-08 03:05:30 +00:00
2026-06-16 13:24:25 +09:00
2026-06-15 03:36:57 +00:00
2026-06-16 13:48:25 +09:00
2026-04-15 14:56:33 +09:00
2026-06-18 17:19:17 +09:00
2026-06-18 17:22:01 +09:00
2026-06-10 15:08:18 +09:00
2026-06-16 13:32:07 +09:00
2026-06-13 06:23:22 +09:00
2026-06-09 22:26:22 +00:00
2026-06-20 05:03:03 +00:00
2026-06-15 14:50:44 +09:00
2026-06-20 05:42:12 +00:00
2026-06-09 05:10:45 +00:00
2026-06-16 13:28:04 +09:00
2026-06-13 22:58:19 +00:00
2026-06-14 03:16:47 +00:00
2026-06-18 17:53:28 +09:00
2026-04-03 11:18:06 +09:00
2026-06-18 16:55:27 +09:00
2026-06-12 06:56:02 +09:00
2026-06-10 16:41:30 +09:00
2026-06-13 09:37:51 +09:00
2026-04-24 06:47:36 +09:00
2026-06-06 21:33:12 +09:00
2026-06-20 04:51:06 +00:00
2026-06-11 16:52:46 +09:00
2026-06-20 04:51:06 +00:00
2026-06-16 13:24:25 +09:00
2026-06-11 16:52:46 +09:00
2026-06-07 08:08:55 +09:00
2026-06-20 04:51:06 +00:00
2026-06-11 16:52:46 +09:00
2026-06-07 15:13:20 +09:00
2026-06-20 05:03:03 +00:00
2026-06-16 13:24:25 +09:00
2026-06-16 14:07:07 +09:00
2026-04-24 06:57:02 +09:00