Files
hyungi_document_server/tests/fixtures/economist_latest_rss.xml
T
hyungi 8583465c58 feat(news): crawl-24x7 사이클 3 — B-4 시그널·C-4 공학 지속·CSB sitemap·CCPS Beacon (마이그 327)
- B-4 fetch_method='signal-only': 페이지 fetch 0 + summarize 스킵(검색 색인만,
  맥미니 부하 0) + 본문 무절단(_entry_body — arXiv 초록 1.6K 보존). 다이제스트는
  ai_summary NULL 제외 규칙으로 자연 배제. 레지스트리 오설정(page) 방어 가드.
- 시드 9 소스 (전 URL 2026-06-11 live 검증): Bloomberg Markets/Technology(skip-video,
  비디오 혼재 실측)·Economist Latest·Nikkei Asia(RDF — feedparser 네이티브, 분기 불요
  fixture 박제)·ASME JPVT(site_1000037 실측 매핑)·arXiv 2종·IEEE Spectrum 2종(feed-full,
  피드 description 이 전문 7.9~14K자 실측).
- csb_collector: sitemap lastmod diff (weekly 월 06:50) — 워터마크(selector_override)
  + cap 40/회 점진 백필 + diff sanity 300 + 보고서 PDF(/assets/, recommendation 제외)
  → extract 파이프라인. 초기 일괄 = CLI --bulk.
- api_standards_collector: 공지 목록 링크 파싱(실측 — 페이지 diff 아님, 상세 URL
  10건/페이지) → 신규 상세만 ingest (monthly 5일 07:05). 초기 백필 = CLI --bulk.
- ccps_collector: aiche.org 평문 403(UA 무관 실측) → playwright-fetcher 익명 컨텍스트
  + referer 쿠키 승계 /download(base64) 신설로 월간 Beacon PDF (monthly 5일 07:20).
  헤드리스 차단 시 CrawlBlocked → health 가시화 (르몽드 PARK 선례).
- B-5 잔여: rdf/feed-reader-UA = 코드 분기 불요 실측 박제 (Economist 는 Archiver UA
  200). table-strip/gn-redirect 는 해당 소스 미진입 — 백로그 유지.
- 테스트 24건 신규 (fixture 9건 live 박제, economist/ieee 는 item trim) — 39 passed.
- 마이그 327 단일 statement (PKM 트랙과 번호 경합 주의 — 327 본 트랙 선점).

Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
2026-06-11 07:13:17 +09:00

72 lines
3.1 KiB
XML
Raw Blame History

This file contains ambiguous Unicode characters
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom">
<channel>
<title>
<![CDATA[Latest Updates]]>
</title>
<description>
<![CDATA[The most recent blogs and online articles from The Economist]]>
</description>
<link>https://www.economist.com/latest</link>
<pubDate>Wed, 10 Jun 2026 21:11:56 +0000</pubDate>
<lastBuildDate>Wed, 10 Jun 2026 21:11:56 +0000</lastBuildDate>
<atom:link href="https://www.economist.com/latest/rss.xml" rel="self" type="application/rss+xml"/>
<item>
<title>
<![CDATA[Syria is an unexpected beneficiary of the Gulf war]]>
</title>
<description>
<![CDATA[The revival of an old oil-export route from Iraq to the Mediterranean helps Syrias new regime]]>
</description>
<link>https://www.economist.com/middle-east-and-africa/2026/06/10/syria-is-an-unexpected-beneficiary-of-the-gulf-war</link>
<guid isPermaLink="false">5737613e-c6cd-4cf0-b7da-fbfb52872f63</guid>
<pubDate>Wed, 10 Jun 2026 19:26:42 +0000</pubDate>
</item>
<item>
<title>
<![CDATA[How to win the World Cup]]>
</title>
<description>
<![CDATA[Being rich helps, but being open to immigration works best of all]]>
</description>
<link>https://www.economist.com/international/2026/06/10/how-to-win-the-world-cup</link>
<guid isPermaLink="false">1019df1e-5c1e-4784-ae0c-31741c176e41</guid>
<pubDate>Wed, 10 Jun 2026 19:07:01 +0000</pubDate>
</item>
<item>
<title>
<![CDATA[American capitalism is run by millionaires, not billionaires]]>
</title>
<description>
<![CDATA[They hide in plain sight—and wield enormous power]]>
</description>
<link>https://www.economist.com/business/2026/06/10/american-capitalism-is-run-by-millionaires-not-billionaires</link>
<guid isPermaLink="false">dbbcb101-a7de-472b-a62c-d969ab033b90</guid>
<pubDate>Wed, 10 Jun 2026 19:01:31 +0000</pubDate>
</item>
<item>
<title>
<![CDATA[New techniques can predict and prevent lung cancer ]]>
</title>
<description>
<![CDATA[A molecular signature can identify those most at risk]]>
</description>
<link>https://www.economist.com/science-and-technology/2026/06/10/new-techniques-can-predict-and-prevent-lung-cancer</link>
<guid isPermaLink="false">dbc7231c-6c7c-42fb-8930-bb099e1d3015</guid>
<pubDate>Wed, 10 Jun 2026 18:48:35 +0000</pubDate>
</item>
<item>
<title>
<![CDATA[The World Cup has always been beset by scandal and strife]]>
</title>
<description>
<![CDATA[So has FIFA, the outfit that administers it]]>
</description>
<link>https://www.economist.com/international/2026/06/10/the-world-cup-has-always-been-beset-by-scandal-and-strife</link>
<guid isPermaLink="false">f2213e72-3531-4894-a33f-47bce2fea4e9</guid>
<pubDate>Wed, 10 Jun 2026 18:25:19 +0000</pubDate>
</item>
</channel>
</rss>