46 lines
23 KiB
Plaintext
46 lines
23 KiB
Plaintext
ulw
|
||
现在在初学记文件夹下,有初学记全部内容的eupb文件。你需要提取他们为一个结构化的json文件。具体做法是,你需要先将eupb文件转为zip文件,zip文件夹下有OPS文件夹,存有各卷内容的html文件。下面以第一卷为例:
|
||
```html
|
||
<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
|
||
<!DOCTYPE html>
|
||
<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="zh" dir="ltr"><head><meta charset="UTF-8"/><link type="text/css" rel="stylesheet" href="main.css"/><title>卷一</title><style typeof="mw:Extension/templatestyles" about="#mwt2"><![CDATA[.mw-parser-output .kaiti,.mw-parser-output .Kaiti,.mw-parser-output .KaiTi,.mw-parser-output .template-kai,.mw-parser-output .template-kai div,.mw-parser-output .template-kai p{font-family:"Kaiti SC",TH-Khaai-TP0,TH-Khaai-TP2,TH-Feon-A,楷体,KaiTi,楷体_GB2312,KaiTi_GB2312,FandolKai,华文楷体,STKaiti,TH-Khaai-PP0,TH-Khaai-PP2,Kai,"Kaiti TC",BiauKai,"AR PL UKai CN",標楷體,DFKai-SB,"AR PL UKai HK","AR PL UKai TW",全字庫正楷體,TW-Kai,EUDCKAI,cursive}.mw-parser-output .template-kai:lang(zh-hant),.mw-parser-output .template-kai:lang(zh-hk),.mw-parser-output .template-kai:lang(zh-mo),.mw-parser-output .template-kai:lang(zh-tw),.mw-parser-output .template-kai:lang(nan-tw),.mw-parser-output .template-kai:lang(hak-tw){font-family:"Kaiti TC",標楷體,DFKai-SB,"AR PL UKai HK","AR PL UKai TW",BiauKai,Kai,全字庫正楷體,TW-Kai,"Kaiti SC",TH-Khaai-PP0,TH-Khaai-PP2,TH-Khaai-TP0,TH-Khaai-TP2,TH-Feon-A,楷体,KaiTi,楷体_GB2312,KaiTi_GB2312,FandolKai,华文楷体,STKaiti,"AR PL UKai CN",EUDCKAI,cursive}]]></style><style typeof="mw:Extension/templatestyles mw:Transclusion" about="#mwt4"><![CDATA[.mw-parser-output .licenseContainer{box-sizing:border-box;margin-top:1em;margin-bottom:0.25em;clear:both;width:auto;page-break-before:always;break-before:page}.mw-parser-output .licenseContainer>div:first-child{display:table;border:2px solid var(--border-subtle,#8888aa);border-collapse:collapse;border-spacing:0 0;empty-cells:hide;box-sizing:border-box;margin:0 auto 0 auto;width:100%;background-color:var(--background-color-neutral-subtle,#f7f8ff);color:var(--color-base,#202122)}.mw-parser-output .licenseContainer>div:first-child>div:first-child{display:table-row-group}.mw-parser-output .licenseContainer>div:first-child>div:first-child>div:first-child{display:table-row}.mw-parser-output .licenseContainer>div:first-child>div:first-child>div:first-child>div{display:table-cell;padding:5px;vertical-align:middle;width:auto}.mw-parser-output .licenseContainer>div:first-child>div:first-child>div:first-child>div:first-child{text-align:left}.mw-parser-output .licenseContainer>div:first-child>div:first-child>div:first-child>div:nth-child(2){text-align:center}.mw-parser-output .licenseContainer>div:first-child>div:first-child>div:first-child>div:nth-child(2)>div{width:100%}.mw-parser-output .licenseContainer>div:first-child>div:first-child>div:first-child>div:nth-child(2)>div:first-child{display:block;margin:0 auto 0 auto;text-align:left}.mw-parser-output .licenseContainer>div:first-child>div:first-child>div:first-child>div:nth-child(2)>div:nth-child(2){display:table;border-top:1px solid var(--border-subtle,#8888aa)}.mw-parser-output .licenseContainer>div:first-child>div:first-child>div:first-child>div:nth-child(2)>div:nth-child(2)>div{display:table-cell;vertical-align:middle}.mw-parser-output .licenseContainer>div:first-child>div:first-child>div:first-child>div:nth-child(2)>div:nth-child(2)>div:first-child{text-align:center;padding:5px 0;width:40px}.mw-parser-output .licenseContainer>div:first-child>div:first-child>div:first-child>div:nth-child(2)>div:nth-child(2)>div:nth-child(2){text-align:left;padding:5px;font-size:92%}.mw-parser-output .licenseContainer>div:first-child>div:first-child>div:first-child>div:nth-child(3){text-align:right}.mw-parser-output .licensetpl{display:none}.mw-parser-output .licenseContainer.wst-collapsible-box{clear:both;margin:0.25em 0 0.25em 0;font-size:95%;background-color:var(--background-color-neutral-subtle,#f7f8ff);color:var(--color-base,#202122);border:2px solid var(--border-subtle,#8888aa);text-align:left;line-height:1.6}.mw-parser-output .wst-license-container-title{display:inline-block;padding:0.5em}.mw-parser-output .wst-license-container-content{background:transparent;color:inherit;margin:0;padding:3px;text-align:left}.mw-parser-output .licenseContainer.warningLicenseContainer>div:first-child{border:2px solid var(--border-color-error,#b22222);background-color:var(--background-color-error-subtle,#ffeeee);color:var(--color-base,#202122)}.mw-parser-output .licenseContainer.warningLicenseContainer>div:first-child>div:first-child>div:first-child>div:nth-child(2)>div:nth-child(2){border:2px solid var(--border-color-error,#b22222);color:var(--color-base,#202122)}]]></style></head><body class="mw-content-ltr sitedir-ltr ltr mw-body-content parsoid-body mediawiki mw-parser-output" dir="ltr" data-mw-parsoid-version="0.23.0.0-alpha12" data-mw-html-version="2.8.0" xml:lang="zh"><section data-mw-section-id="0"><table style="width:100%; margin-top:0px;border:1px solid #299ec9; background-color: #93d7f0; text-align:center;" about="#mwt1" typeof="mw:Transclusion">
|
||
<tbody><tr>
|
||
<td class="noprint" style="width:0; text-align:left; font-size:small;"/>
|
||
<td style="width:50%;">初學記 卷一</td>
|
||
<td class="noprint" style="width:25%; text-align:right; font-size:small;"><a rel="mw:WikiLink" href="c2_chu_xue_ji__si_ku_quan_shu_ben__juan02.xhtml" title="初學記 (四庫全書本)/卷02">卷二</a> <span style="color:#299ec9">→</span></td></tr>
|
||
</tbody></table><span about="#mwt1">
|
||
</span><div class="kaiti" style="overflow: auto; height:90%; width:100%; writing-mode: vertical-rl; -webkit-writing-mode: vertical-rl; writing-mode: tb-rl; layout-flow: vertical-ideographic; overflow:auto; float:right; font-size:150%; *display: inline; overflow: auto; padding: 9px; " about="#mwt1">
|
||
<meta typeof="mw:Includes/OnlyInclude"/><div class="poem" typeof="mw:Extension/poem" about="#mwt303"><p> 欽定四庫全書<br/>
|
||
<span id="chu_xue_ji_juan_yi-n464" about="#mwt6" typeof="mw:Transclusion"><a rel="mw:WikiLink" href="https://zh.wikisource.org/wiki/初學記_(四庫全書本)/卷01#chu_xue_ji_juan_yi-n464" class="mw-selflink-fragment">初學記卷一</a></span><br/>
|
||
唐 徐堅 撰<br/>
|
||
<span id="tian_bu-n465" about="#mwt7" typeof="mw:Transclusion"><a rel="mw:WikiLink" href="https://zh.wikisource.org/wiki/初學記_(四庫全書本)/卷01#tian_bu-n465" class="mw-selflink-fragment">天部</a></span><br/>
|
||
天第一<small about="#mwt8" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>叙事<span style="color:transparent;font-size:0px">〉</span></small>河圖括地象云易有太極是生兩儀兩儀未分其氣混沌清濁既分伏者為天偃者為地釋名云天坦也坦然髙而逺也物理論云水土之氣升為天爾雅云春為蒼天夏為昊天秋為旻天冬為上天廣雅云南方曰炎天西南方曰朱天西方曰成天西北方曰幽天北方曰𤣥天東北方曰變天<small about="#mwt9" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>九天亦名九野<span style="color:transparent;font-size:0px">〉</span></small>九天之際曰九垠<small about="#mwt10" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>魚勤反堮也<span style="color:transparent;font-size:0px">〉</span></small>九天之外次曰九陔<small about="#mwt11" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>居核反陔階也言其階次有九<span style="color:transparent;font-size:0px">〉</span></small>凡天去地二億一萬六千七百八十一里半度地之厚與天髙等天南北相去二億三萬三千五十七里二十五歩東西短減四歩纂要云東西南北曰四方四方之隅曰四維天地四方曰六合天地曰二儀以人參之曰三才四方上下謂之宇往古來今謂之宙或謂天地為宇宙凡天地元氣之所生天謂之乾地謂之坤天圓而色𤣥地方而色黄日月謂之兩曜五星謂之五緯<small about="#mwt12" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>五星者東方嵗南方熒惑西方太白北方辰中央鎮<span style="color:transparent;font-size:0px">〉</span></small>日月星謂之三辰亦曰三光日月五星謂之七曜天河謂之天漢<small about="#mwt13" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>亦曰雲漢星漢河漢清漢銀漢天津漢津淺河銀河絳河<span style="color:transparent;font-size:0px">〉</span></small>五經通義云天神之大者曰昊天上帝<small about="#mwt14" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>即曜魄寳也亦曰天皇大帝亦曰太一<span style="color:transparent;font-size:0px">〉</span></small>其佐曰五帝<small about="#mwt15" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>東方青帝靈威仰南方赤帝赤熛怒西方白帝白招拒北方黑帝叶光紀中央黄帝含樞紐<span style="color:transparent;font-size:0px">〉</span></small>事對轉葢 倚杵<small about="#mwt16" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>桓譚新論天如葢轉左旋日月星辰随而東西河圗挺佐輔曰百世之後地髙天下如此千嵗之後而天可倚杵洶洶莫知始終<span style="color:transparent;font-size:0px">〉</span></small>覆盆 轉轂<small about="#mwt17" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>王充論衡曰天平與地無異若覆盆之狀渾天儀曰二十八宿半隠半見天轉如車轂之運<span style="color:transparent;font-size:0px">〉</span></small>象葢 如笠<small about="#mwt18" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>劉氏正歴問曰顓頊造渾天儀黄帝為葢天以天象葢虞昺穹天論曰天形如笠而冒地之表<span style="color:transparent;font-size:0px">〉</span></small>玉儀 銅渾<small about="#mwt19" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>尚書考靈曜曰觀玉儀之旋昏明主時鄭注曰以玉為渾儀故曰玉儀名臣奏曰今史官所用𠉀臺銅儀則渾天法也述征記曰長安南有靈臺上有銅渾天儀<span style="color:transparent;font-size:0px">〉</span></small>設位 垂象<small about="#mwt20" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>易曰天地設位而易行乎其中又曰天垂象見吉凶聖人象之<span style="color:transparent;font-size:0px">〉</span></small>髙明 貞觀<small about="#mwt21" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>禮記曰天地之道博也厚也髙也明也悠也乆也鄭注曰此言其善見功成易曰天地之道貞觀者也日月之道貞明者也<span style="color:transparent;font-size:0px">〉</span></small>三體 六氣<small about="#mwt22" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>蔡邕天文志言天體者三一曰周髀二曰宣夜三曰渾天左傳曰天有六氣降生五味杜預注曰六氣者隂陽風雨晦明<span style="color:transparent;font-size:0px">〉</span></small>四極 九野<small about="#mwt23" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>淮南子曰昔者女媧氏鍊五色石以補蒼天斷鼇足以立四極髙誘注曰鼇大龜也天廢傾以鼇足柱之九野見上<span style="color:transparent;font-size:0px">〉</span></small>九重 八柱<small about="#mwt24" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>楚詞曰圓則九重孰營度之王逸注曰言天圓而九重誰營度而知之又八柱何當東南何虧注曰言天有八山為柱也<span style="color:transparent;font-size:0px">〉</span></small>折柱 絶維<small about="#mwt25" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>列子曰共工氏與顓頊争為帝怒觸不周山折天柱絶地維故天傾西北日月星辰就焉地缺東南百川水潦歸焉宋玉大言賦曰壯士歘兮絶天維北斗戾兮太山夷<span style="color:transparent;font-size:0px">〉</span></small>雨粟 降秬<small about="#mwt26" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>周書曰神農之時天雨粟農耕而種之孫氏瑞應圖曰舜時后稷播植天降秬秠故詩曰天降嘉種惟秬惟秠<span style="color:transparent;font-size:0px">〉</span></small>降麰 下榖<small about="#mwt27" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>漢書曰来麰大麥也始自天降以致和復天助也孔叢子曰魏王問子順曰寡人聞昔者上天神異后稷而為之下嘉穀周遂以興<span style="color:transparent;font-size:0px">〉</span></small><span about="#mwt28" typeof="mw:Transclusion">𣏌</span>國憂 秦宓答<small about="#mwt29" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>列子曰杞國昔有人憂天崩墜身無所寄廢於寢食又有憂彼憂者曉之曰天積氣耳奈何而崩墜乎其人曰天果積氣日月星宿不當墜也曉者曰日月星是氣中之有光耀者正復使墜亦不能有傷蜀志曰呉使張温来聘温問秦宓曰天有頭乎宓曰有之温曰在何方宓曰詩云乃眷西顧以此推之頭在西方温曰天有耳乎宓曰天處髙而聽卑詩云鶴鳴九臯聲聞于天若其無耳何以聽之温曰天有足乎宓曰詩云天步艱難若其無足何以𡵯之温曰天有姓乎宓曰姓劉曰何以知之宓曰其子姓劉以此知之<span style="color:transparent;font-size:0px">〉</span></small>命虞 啟魏<small about="#mwt30" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>史記曰叔虞母夢天謂武王曰余命汝生子名虞余與之虞及生子有文在手曰虞遂因命之左傳晉侯賜畢萬魏卜偃曰畢萬之後必大萬盈數也魏大名也以是始賞天啟之矣<span style="color:transparent;font-size:0px">〉</span></small>授楚 錫秦<small about="#mwt31" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>左傳曰公孫歸父㑹楚子於宋宋人告急于晉晉侯欲救之伯宗曰不可天方授楚未可與争雖晉之强能違天乎張衡西京賦曰昔者大帝說秦穆公而覲之乃為金冊錫用此土而剪諸鶉首以上直載天<span style="color:transparent;font-size:0px">〉</span></small>油雲 膏雨<small about="#mwt32" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>孟子曰油然作雲霈然下雨左傳小國之仰大國也如百穀之仰膏雨<span style="color:transparent;font-size:0px">〉</span></small>榆星 桂月<small about="#mwt33" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>古樂府詩曰天上何所有歴歴種白榆虞喜安天論曰俗傳月中仙人桂樹今視其初生見仙人之足漸已成形桂樹後生<span style="color:transparent;font-size:0px">〉</span></small>璧月 珠露<small about="#mwt34" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>尚書中𠉀曰天地開闢甲子冬至日月若懸璧五星若編珠李顒感興賦曰風觸波而文結兮露霑丹而珠凝<span style="color:transparent;font-size:0px">〉</span></small>紫電 文虹<small about="#mwt35" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>曹毗霖雨詩曰洪霖彌旬日翳翳四區昏紫電光飛牖迅雷終天奔傅𤣥陽春賦曰習習谷風洋洋綠泉丹霞布景文虹竟天<span style="color:transparent;font-size:0px">〉</span></small>繒雲 絲雨<small about="#mwt36" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>易通卦驗曰立秋燭隂雲出如赤繒張協詩曰金風扇景節丹霞啟隂期騰雲似漏網宻雨如散絲<span style="color:transparent;font-size:0px">〉</span></small>風駟 雲車<small about="#mwt37" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>仲長統詩曰春雲為馬秋風為駟按之不遲勞之不疾魏武帝古樂府詩曰願得神之人乗駕雲車驂白鹿上到天之門来賜神之藥<span style="color:transparent;font-size:0px">〉</span></small>錦雲 縠霧<small about="#mwt38" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>成公綏雲賦曰或繡文錦章依㣲要妙宋玉神女賦曰動霧縠以徐步拂珮聲之珊珊<span style="color:transparent;font-size:0px">〉</span></small>文露 光風<small about="#mwt39" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>春秋佐助期曰武露布文露沉宋均注曰甘露見其國布散者人尚武文采者則甘露凝重楚詞曰川谷徑復流潺湲光風轉蕙汎崇蘭王逸注曰謂雨已出日而風草木有光也<span style="color:transparent;font-size:0px">〉</span></small>祥風 甘雨<small about="#mwt40" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>尚書大傳曰徳及皇天則祥風起括地圗曰谷山有叢雲甘雨<span style="color:transparent;font-size:0px">〉</span></small>翠雲 紫蜺<small about="#mwt41" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>馮衍明志賦曰駟素蚪而馳騁兮乘翠雲而相羊<span typeof="mw:File"><span><img alt="揚 --(『昜』上『旦』之『日』與『一』相連)" resource="./File:SKQSfont.pdf" src="images/c34_SKQSfont.pdf_page3951_20px_SKQSfont.pdf.jpg" decoding="async" data-file-type="office" class="mw-file-element" style="width:20px; height:20px; " data-title="SKQSfont.pdf-page3951-20px-SKQSfont.pdf.jpg"/></span></span>雄太𤣥經曰紫蜺圍日其疾不割<span style="color:transparent;font-size:0px">〉</span></small>姮娥月 少女風<small about="#mwt42" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>淮南子曰羿請不死之藥於西王母羿妻姮娥竊之奔月託身於月是為蟾蠩而為月精管公明别傳曰公明在清河于時大旱問何時雨言今夜當大雨至日向暮了無雲氣衆人並讙嗤公明公明言樹上已有少女微風樹間隂鳥和鳴若少女反風隂鳥亂翔其應至矣須㬰𤣥雲四集大雨注傾<span style="color:transparent;font-size:0px">〉</span></small>白鶴雲 黄雀風<small about="#mwt43" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>易通卦驗曰立春青陽雲出房如積水春分正陽雲出張如白鶴周處風土記曰五月大雨名為濯枝五月風發六日乃止黄雀風是時海魚變為黄雀因以名之以上總載天<span style="color:transparent;font-size:0px">〉</span></small>賦晉成公綏天地賦<small about="#mwt44" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>天地至神難以一言定其稱故體而言之則曰兩儀性而言之則曰柔剛色而言之則曰𤣥黄名而言之則曰天地若乃懸象成文列宿有章三辰燭曜五緯重光衆星回而環極招揺運而指方白虎峙據於參昴青龍垂尾於心房𤣥龜匿首於女虛朱雀奮翼於軫張垣屏絡繹而珠連三台差池而鴈行軒轅華布而曲列攝提鼎峙而相望<span style="color:transparent;font-size:0px">〉</span></small>詩晉傅𤣥兩儀詩<small about="#mwt45" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>兩儀始分元氣清列宿垂象六位成日月西流景東征悠悠萬物殊品名聖人憂代念羣生<span style="color:transparent;font-size:0px">〉</span></small>又歌天詩<small about="#mwt46" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>天行一何健日月無髙踪百川皆赴海三辰回泰蒙<span style="color:transparent;font-size:0px">〉</span></small>梁劉孝綽三光篇<small about="#mwt47" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>三光垂表象天地有晷度聲和善響應形立景自附素日抱𤣥烏明月懐靈兎<span style="color:transparent;font-size:0px">〉</span></small>陳張正見賦得秋河曙耿耿詩<small about="#mwt48" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>耿耿長河曙濫濫宿雲浮天路横秋水星衡轉夜流月下姮娥落風驚織女秋徳星猶可見仙槎不復留<span style="color:transparent;font-size:0px">〉</span></small>宋之問明河篇<small about="#mwt49" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>八月凉風天氣晶萬里無雲河漢明昏見南樓清且淺曉落西山縦復横洛陽城闕天中起長河夜夜千門裏複道連甍共蔽虧畫堂瓊户特相宜雲母屏前初汎濫水精簾外轉逶迤倬彼昭回如練白復出東城接南陌南陌征人去不歸誰知今夜𢷬寒衣鴛鴦綺上疎螢度烏鵲橋邉一鴈飛鴈飛螢度愁難歇坐見河傾漸微沒已能舒卷任浮雲不惜光輝譲明月明河可望不可親願得乗槎一問津更將織女支機石還訪成都賣卜人<span style="color:transparent;font-size:0px">〉</span></small>讚郭璞釋天地圗讚<small about="#mwt50" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>祭地肆瘞郊天致煙氣升太一精淪九泉至敬不文明徳惟鮮<span style="color:transparent;font-size:0px">〉</span></small>宋何承天天讚<small about="#mwt51" typeof="mw:Transclusion"><span style="color:transparent;font-size:0px">〈</span>軒轅改物以經天人容成造歴大撓創辰龍集有次星紀乃分<span style="color:transparent;font-size:0px">〉</span></small><br/>
|
||
```
|
||
你需要提取出以下信息:
|
||
卷:01;词条:天;叙事:“河圖括地象云易有太極是生兩儀兩儀未分”;事对:“”;诗文:“”;(叙事、事对、诗文中可能有一些解释说明的小字,你需要括起来)
|
||
就像上面显示的,在各卷的xhtml文件中,你需要自行解析文件结构。大致结构为每一卷中有一些部,每一部中有一些词条,如 卷一 天部 的词条有天第一,日第二,月第三,星第四 等,每一词条下的内容可分为“叙事”、“事对”和“诗文”等,具体情况视各词条而定。最终需要将全部二十二卷整合为一个json文件,主键为各词条,格式为:
|
||
{
|
||
"metadata": {
|
||
"title": "初学记",
|
||
"author": "徐坚",
|
||
"dynasty": "唐",
|
||
"total_volumes": 30,
|
||
"source": "2026年1月28日从维基文库导出"
|
||
},
|
||
"preface": (如有),
|
||
"categories": {
|
||
"天": [
|
||
{
|
||
"volume": 01,
|
||
"section": "天部",
|
||
"content":{
|
||
"叙事": "...";
|
||
"事对": "...";
|
||
"诗文": "..."
|
||
}
|
||
}
|
||
],
|
||
……
|
||
}
|
||
}
|
||
当前文件夹下,还有一个之前整理的,但没整理好的半成品。你可以在此基础上整理,也可以删除之另行整理。 |