{
  "version": "1.0",
  "source_chunks_path": "01_chunks.json",
  "strategy": {
    "front_ratio": 0.15,
    "middle_positions": [
      0.25,
      0.5,
      0.75
    ],
    "tail_ratio": 0.1,
    "high_density_limit": 30,
    "front_cap": 12,
    "tail_cap": 8,
    "middle_radius": 1,
    "preview_chars": 220
  },
  "samples": [
    {
      "sample_id": "OS0001",
      "chunk_id": "C0001",
      "start_para": 1,
      "end_para": 5,
      "sampling_reason": "front_loaded",
      "char_len": 540,
      "text_preview": "第一回 望成名學究訓頑兒 講制藝鄉紳勖後進 ---------------------------------------- 話說陝西同州府朝邑縣，城南三十四地方，原有一個村莊。這莊內住的衹有趙、方二姓，并無他族。這莊叫小不小，叫大不大，也有二三十戶人家。祖上世代務農。到了姓趙的爺爺手裏，居然請了先生，教他兒子攻書，到他孫子，忽然得中一名黌門秀士 。鄉里人眼淺，看見中了秀才，竟是非同小可，合莊的人，都把他推戴起來，姓方的便漸漸的不敵了。...",
      "chunk_index": 1,
      "all_reasons": [
        "front_loaded",
        "high_entity_density"
      ],
      "density_score": 0.135,
      "signal_terms": {
        "chapter_headings": [],
        "quoted_phrases": [],
        "named_places": [],
        "titled_people": [],
        "organizations": [],
        "artifacts": [],
        "term_highlights": [],
        "signal_counts": {
          "chapter_headings": 0,
          "quoted_phrases": 0,
          "named_places": 0,
          "titled_people": 0,
          "organizations": 0,
          "artifacts": 0,
          "conservative_terms": 0
        }
      }
    },
    {
      "sample_id": "OS0002",
      "chunk_id": "C0002",
      "start_para": 5,
      "end_para": 7,
      "sampling_reason": "front_loaded",
      "char_len": 1818,
      "text_preview": "----- \"開講\"：指八股文中的第三段，為初學寫八股文的人所為。且說是年正值\"大比之年\"，那姓趙的便送孫子去趕大考。考罷回家，天天望榜，自不必說。到了重陽過後，有一天早上，大家方在睡夢之中，忽聽得一陣馬鈴聲響，大家被他驚醒。開門看處，衹見一群人，簇擁著向西而去。仔細一打聽，都說趙相公考中了舉人了。此時方必開也隨了大眾在街上看熱鬧，得了這個信息，連忙一口氣跑到趙家門前探望。衹見有一群人，頭上戴著紅纓帽子，正忙著在那裡貼報條呢。方必開自從...",
      "chunk_index": 2,
      "all_reasons": [
        "front_loaded",
        "high_entity_density"
      ],
      "density_score": 2.2,
      "signal_terms": {
        "chapter_headings": [],
        "quoted_phrases": [],
        "named_places": [
          "拉翰林"
        ],
        "titled_people": [],
        "organizations": [],
        "artifacts": [],
        "term_highlights": [
          "拉翰林"
        ],
        "signal_counts": {
          "chapter_headings": 0,
          "quoted_phrases": 0,
          "named_places": 1,
          "titled_people": 0,
          "organizations": 0,
          "artifacts": 0,
          "conservative_terms": 1
        }
      }
    },
    {
      "sample_id": "OS0003",
      "chunk_id": "C0003",
      "start_para": 7,
      "end_para": 13,
      "sampling_reason": "middle_stratified",
      "char_len": 1887,
      "text_preview": "----- 拉翰林：考取的進士除一甲三名，照例授職翰林院外，其他還參加朝考，由皇帝圈點成績優秀者為翰林院庶吉士。那時候，方必開聽了先生教他兒子的一番話，心上一時歡喜，喉嚨裏的痰也就活動了許多，後來又聽見先生說什麼做了官就有錢賺，他就哇的一聲，一大口的粘痰嘔了出來。剛剛吐得一半，忽然又見他兒子回駁先生的幾句話，駁的先生頓口無言，他的痰也就擱在嘴裏頭，不往外吐了，直鉤鉤兩衹眼睛，瞅著先生，看他拿什麼話回答學生。衹見那王仁楞了好半天，臉上紅一...",
      "chunk_index": 3,
      "all_reasons": [
        "middle_stratified",
        "high_entity_density"
      ],
      "density_score": 2.2,
      "signal_terms": {
        "chapter_headings": [],
        "quoted_phrases": [],
        "named_places": [
          "拉翰林"
        ],
        "titled_people": [],
        "organizations": [],
        "artifacts": [],
        "term_highlights": [
          "拉翰林"
        ],
        "signal_counts": {
          "chapter_headings": 0,
          "quoted_phrases": 0,
          "named_places": 1,
          "titled_people": 0,
          "organizations": 0,
          "artifacts": 0,
          "conservative_terms": 1
        }
      }
    },
    {
      "sample_id": "OS0004",
      "chunk_id": "C0004",
      "start_para": 13,
      "end_para": 19,
      "sampling_reason": "middle_stratified",
      "char_len": 2119,
      "text_preview": "----- 龍門：指鄉試考場的二門，也有指第三門，其意是跨過這門就可一舉成爺兒兩個正在屋裏講話。忽然外面一片人聲吵鬧。問是甚麼事情，衹見趙溫的爺爺滿頭是汗，正在那裡跺著腳罵廚子，說：\"他們到如今還不來！這些王八崽子，不吃好草料的！停會子告訴王鄉紳，一定送他們到衙門裏去！\"嘴裏罵著，手裏拿著一頂大帽子，借他當扇子扇，搖來搖去，氣得眼睛都發了紅了。正說著，衹見廚子挑了碗盞家伙進來。大家拿他抱怨。廚名，取\"鯉魚跳龍門\"的意思。子回說：\"我的爺...",
      "chunk_index": 4,
      "all_reasons": [
        "middle_stratified",
        "high_entity_density"
      ],
      "density_score": 2.2,
      "signal_terms": {
        "chapter_headings": [],
        "quoted_phrases": [],
        "named_places": [
          "翰林"
        ],
        "titled_people": [],
        "organizations": [],
        "artifacts": [],
        "term_highlights": [
          "翰林"
        ],
        "signal_counts": {
          "chapter_headings": 0,
          "quoted_phrases": 0,
          "named_places": 1,
          "titled_people": 0,
          "organizations": 0,
          "artifacts": 0,
          "conservative_terms": 1
        }
      }
    },
    {
      "sample_id": "OS0005",
      "chunk_id": "C0005",
      "start_para": 19,
      "end_para": 22,
      "sampling_reason": "middle_stratified",
      "char_len": 1148,
      "text_preview": "----- 制藝：指八股文。經濟：經邦濟世、治理國家。王鄉紳一聽此言，不禁眉飛色舞，拿手向王孝廉身上一拍，說道：\"對了，老侄，你能夠說出這句話來，你的文章也著實有工夫了。現在我雖不求仕進，你也無意功名，你在鄉下授徒，我在城中掌教，一樣是替路先生宏宣教育，替我聖朝培養人才。這裡頭消長盈虛，關系甚重。老侄你自己不要看輕，這個重擔，卻在我叔侄兩人身上，將來維持世運，歷劫不磨。趙世兄他目前雖說是新中舉，總是我們斯文一脈，將來昌明聖教，繼往開來，...",
      "chunk_index": 5,
      "all_reasons": [
        "middle_stratified",
        "high_entity_density"
      ],
      "density_score": 0.287,
      "signal_terms": {
        "chapter_headings": [],
        "quoted_phrases": [],
        "named_places": [],
        "titled_people": [],
        "organizations": [],
        "artifacts": [],
        "term_highlights": [],
        "signal_counts": {
          "chapter_headings": 0,
          "quoted_phrases": 0,
          "named_places": 0,
          "titled_people": 0,
          "organizations": 0,
          "artifacts": 0,
          "conservative_terms": 0
        }
      }
    },
    {
      "sample_id": "OS0006",
      "chunk_id": "C0006",
      "start_para": 23,
      "end_para": 23,
      "sampling_reason": "middle_stratified",
      "char_len": 2192,
      "text_preview": "---------------------------------------- 話說趙家中舉開賀，一連忙了幾天，便有本學老師叫門鬥 傳話下來，叫趙溫即日赴省，填寫親供 。當下爺兒三代，買了酒肉，請門鬥飽餐一頓，又給了幾百銅錢。門鬥去後，趙溫便躊躇這親供如何填法，幸虧請教了老前輩王孝廉，一五一十的都教給他。趙溫不勝之喜。他爺爺又向親家方必開商量，要請王孝廉同到省城去走一遭，隨時可以請教。方必開一來迫于太親翁之命，二來是他女兒大伯子中舉的...",
      "chunk_index": 6,
      "all_reasons": [
        "middle_stratified",
        "high_entity_density"
      ],
      "density_score": 4.0,
      "signal_terms": {
        "chapter_headings": [],
        "quoted_phrases": [],
        "named_places": [
          "可便道來城",
          "一直進城"
        ],
        "titled_people": [],
        "organizations": [],
        "artifacts": [],
        "term_highlights": [
          "可便道來城",
          "一直進城"
        ],
        "signal_counts": {
          "chapter_headings": 0,
          "quoted_phrases": 0,
          "named_places": 2,
          "titled_people": 0,
          "organizations": 0,
          "artifacts": 0,
          "conservative_terms": 2
        }
      }
    },
    {
      "sample_id": "OS0007",
      "chunk_id": "C0007",
      "start_para": 23,
      "end_para": 23,
      "sampling_reason": "middle_stratified",
      "char_len": 2133,
      "text_preview": "所以反不及他做典史的，倒可以事事躬親，實事求是。老侄，你想他這話，是一點不錯的呢。這人做官倒著實有點才幹，的的確確是位理財好手。\"王孝廉道：\"俗話說的好，'千裏為官衹為財'。\"王鄉紳道：\"正是這話。現在我想明年趙世兄上京會試，倒可叫他跟著我們內兄一路前去，諸事托他招呼招呼，他卻是很在行的。\"王孝廉道：\"這是最好的，還有什麼說得。\"當下王孝廉見王鄉紳眼睛不睬趙溫，瞧他坐在那裡沒得意思，就把這話告訴他一遍。趙溫除了說\"好\"之外，亦沒有別的話...",
      "chunk_index": 7,
      "all_reasons": [
        "middle_stratified",
        "high_entity_density"
      ],
      "density_score": 2.4,
      "signal_terms": {
        "chapter_headings": [],
        "quoted_phrases": [],
        "named_places": [],
        "titled_people": [],
        "organizations": [
          "隨時指教"
        ],
        "artifacts": [],
        "term_highlights": [
          "隨時指教"
        ],
        "signal_counts": {
          "chapter_headings": 0,
          "quoted_phrases": 0,
          "named_places": 0,
          "titled_people": 0,
          "organizations": 1,
          "artifacts": 0,
          "conservative_terms": 1
        }
      }
    },
    {
      "sample_id": "OS0008",
      "chunk_id": "C0008",
      "start_para": 23,
      "end_para": 23,
      "sampling_reason": "middle_stratified",
      "char_len": 2199,
      "text_preview": "\"趙溫果然聽話，便捧了文章進來，在煙鋪空的一邊躺下，嘴裏還是念個不了，錢典史卻不便阻他，自己呼了幾口煙，又吃些水果、于點心之類，又拿起茶壺，就著壺嘴抽上兩口，把壺放下，順手拎過一支紫銅水煙袋，坐在床沿上吃水煙，一個吃個不了。後來，錢典史被他噪聒的實在不耐煩，便借著賀根來出氣。先說他偷懶不肯做事，後來又說他今天在路上買饅頭，四個錢一個，他硬要五個半錢一個，十二個饅頭，便賺了十八了錢，真真是混帳東西！頭裏賀根聽見舅老爺說他偷懶，已經滿肚皮不...",
      "chunk_index": 8,
      "all_reasons": [
        "middle_stratified",
        "high_entity_density"
      ],
      "density_score": 4.0,
      "signal_terms": {
        "chapter_headings": [],
        "quoted_phrases": [],
        "named_places": [
          "既不比做州",
          "我這番出山"
        ],
        "titled_people": [],
        "organizations": [],
        "artifacts": [],
        "term_highlights": [
          "既不比做州",
          "我這番出山"
        ],
        "signal_counts": {
          "chapter_headings": 0,
          "quoted_phrases": 0,
          "named_places": 2,
          "titled_people": 0,
          "organizations": 0,
          "artifacts": 0,
          "conservative_terms": 2
        }
      }
    },
    {
      "sample_id": "OS0009",
      "chunk_id": "C0009",
      "start_para": 23,
      "end_para": 23,
      "sampling_reason": "middle_stratified",
      "char_len": 2194,
      "text_preview": "這裡趙溫會著幾個同年，把一應投文復試的事，都托了一位同年替他帶辦，免得另外求人，倒也省事不少。不過大幫復試已過，直好等到二十八這一天，同著些後來的在殿廷上復的試，居然取在三等裏面，奉旨準他一體會試。趙溫便高興的了不得，寫信稟告他爺爺、父親知道。這裡自從到京，頭一樁忙著便是拜老師。趙溫請教了同年，把貼子寫好，又封了二兩銀子的贄見，四吊錢的門包。他老師吳贊善，住在順治門外，趙、錢二位卻住在米市胡同，相去還不算遠。這天趙溫起了一個大早，連累了...",
      "chunk_index": 9,
      "all_reasons": [
        "middle_stratified",
        "high_entity_density"
      ],
      "density_score": 0.4,
      "signal_terms": {
        "chapter_headings": [],
        "quoted_phrases": [],
        "named_places": [],
        "titled_people": [],
        "organizations": [],
        "artifacts": [],
        "term_highlights": [],
        "signal_counts": {
          "chapter_headings": 0,
          "quoted_phrases": 0,
          "named_places": 0,
          "titled_people": 0,
          "organizations": 0,
          "artifacts": 0,
          "conservative_terms": 0
        }
      }
    },
    {
      "sample_id": "OS0010",
      "chunk_id": "C0010",
      "start_para": 23,
      "end_para": 25,
      "sampling_reason": "middle_stratified",
      "char_len": 719,
      "text_preview": "\"趙溫一定要他去，賀根推頭天還早，一定要歇一會子再去。主僕兩個就拌起嘴來。還是錢典史聽不過，爬起來幫著趙溫吆喝了兩句，他才嘰哩咕嚕的一路罵了出去。這一天，趙溫就同熱鍋上的螞蟻一般，茶飯無心，坐立不定。到得下午，便有人來說，誰又中了，誰又中了。偏生賀根從天不亮出去，一直到晚不曾回來。趙溫急的跳腳，等到晚上，街上人說榜都填完了，衹等著\"填五魁 \"了。賀根知道沒了指望，方才回寓。填五魁：五魁，即五經魁，鄉試的前五名，在發榜時是最後從第五名倒填...",
      "chunk_index": 10,
      "all_reasons": [
        "middle_stratified",
        "high_entity_density"
      ],
      "density_score": 0.18,
      "signal_terms": {
        "chapter_headings": [],
        "quoted_phrases": [],
        "named_places": [],
        "titled_people": [],
        "organizations": [],
        "artifacts": [],
        "term_highlights": [],
        "signal_counts": {
          "chapter_headings": 0,
          "quoted_phrases": 0,
          "named_places": 0,
          "titled_people": 0,
          "organizations": 0,
          "artifacts": 0,
          "conservative_terms": 0
        }
      }
    },
    {
      "sample_id": "OS0011",
      "chunk_id": "C0011",
      "start_para": 26,
      "end_para": 26,
      "sampling_reason": "tail_stratified",
      "char_len": 2198,
      "text_preview": "---------------------------------------- 話說趙溫自從正月出門到今，不差已將三月。衹因離家日久，千般心緒，萬種情懷，正在無可排遣，恰好春風報罷，即擬整頓行裝，起身回去。不料他爺爺望他成名心切，寄來一封書信，又匯到二千多兩銀子，書上寫著：\"倘若聯捷，固為可喜；如其報罷，即趕緊捐一中書，在京供職。\"信上并寫明是王鄉紳的主意，\"所以東拼西湊，好容易弄成這個數目。望你好好在京做官。你在外面做官，家裏便免得...",
      "chunk_index": 11,
      "all_reasons": [
        "tail_stratified",
        "high_entity_density"
      ],
      "density_score": 3.8,
      "signal_terms": {
        "chapter_headings": [],
        "quoted_phrases": [],
        "named_places": [
          "指知府"
        ],
        "titled_people": [],
        "organizations": [
          "指知府"
        ],
        "artifacts": [],
        "term_highlights": [
          "指知府"
        ],
        "signal_counts": {
          "chapter_headings": 0,
          "quoted_phrases": 0,
          "named_places": 1,
          "titled_people": 0,
          "organizations": 1,
          "artifacts": 0,
          "conservative_terms": 1
        }
      }
    },
    {
      "sample_id": "OS0012",
      "chunk_id": "C0012",
      "start_para": 26,
      "end_para": 26,
      "sampling_reason": "tail_stratified",
      "char_len": 2195,
      "text_preview": "告訴他，替他墊了一百兩銀子，起先徐家裏還不肯寫，後來看我面上卻不過，他才寫的。靴掖子：皮或緞子做的夾子，放在靴筒裏。四恒：清末四大銀號，都以\"恒\"字為名。錢典史自是感激不盡，忙著連夜收拾行李，打算後天長行，一直到省。結算下來，衹有他盟弟胡理處，尚有首尾未清。他盟弟外面雖然大方，心裡極其嗇刻，想錢典史同他算清，面子上又不好露出。因見錢典史有一個翡翠的帶頭子，值得幾文，從前錢典史也說過要賣掉他。胡理到此就心生一計，說有主顧要買，騙到手，估算...",
      "chunk_index": 12,
      "all_reasons": [
        "tail_stratified",
        "high_entity_density"
      ],
      "density_score": 7.2,
      "signal_terms": {
        "chapter_headings": [],
        "quoted_phrases": [],
        "named_places": [
          "護院",
          "衹要府"
        ],
        "titled_people": [],
        "organizations": [
          "護院",
          "衹要府"
        ],
        "artifacts": [],
        "term_highlights": [
          "護院",
          "衹要府"
        ],
        "signal_counts": {
          "chapter_headings": 0,
          "quoted_phrases": 0,
          "named_places": 2,
          "titled_people": 0,
          "organizations": 2,
          "artifacts": 0,
          "conservative_terms": 2
        }
      }
    }
  ],
  "stats": {
    "chunk_count": 12,
    "sample_count": 12,
    "counts_by_primary_reason": {
      "front_loaded": 2,
      "middle_stratified": 8,
      "tail_stratified": 2
    },
    "counts_by_any_reason": {
      "front_loaded": 2,
      "high_entity_density": 12,
      "middle_stratified": 8,
      "tail_stratified": 2
    },
    "top_density_chunks": [
      {
        "chunk_id": "C0012",
        "chunk_index": 12,
        "density_score": 7.2,
        "term_highlights": [
          "護院",
          "衹要府"
        ],
        "signal_counts": {
          "chapter_headings": 0,
          "quoted_phrases": 0,
          "named_places": 2,
          "titled_people": 0,
          "organizations": 2,
          "artifacts": 0,
          "conservative_terms": 2
        }
      },
      {
        "chunk_id": "C0006",
        "chunk_index": 6,
        "density_score": 4.0,
        "term_highlights": [
          "可便道來城",
          "一直進城"
        ],
        "signal_counts": {
          "chapter_headings": 0,
          "quoted_phrases": 0,
          "named_places": 2,
          "titled_people": 0,
          "organizations": 0,
          "artifacts": 0,
          "conservative_terms": 2
        }
      },
      {
        "chunk_id": "C0008",
        "chunk_index": 8,
        "density_score": 4.0,
        "term_highlights": [
          "既不比做州",
          "我這番出山"
        ],
        "signal_counts": {
          "chapter_headings": 0,
          "quoted_phrases": 0,
          "named_places": 2,
          "titled_people": 0,
          "organizations": 0,
          "artifacts": 0,
          "conservative_terms": 2
        }
      },
      {
        "chunk_id": "C0011",
        "chunk_index": 11,
        "density_score": 3.8,
        "term_highlights": [
          "指知府"
        ],
        "signal_counts": {
          "chapter_headings": 0,
          "quoted_phrases": 0,
          "named_places": 1,
          "titled_people": 0,
          "organizations": 1,
          "artifacts": 0,
          "conservative_terms": 1
        }
      },
      {
        "chunk_id": "C0007",
        "chunk_index": 7,
        "density_score": 2.4,
        "term_highlights": [
          "隨時指教"
        ],
        "signal_counts": {
          "chapter_headings": 0,
          "quoted_phrases": 0,
          "named_places": 0,
          "titled_people": 0,
          "organizations": 1,
          "artifacts": 0,
          "conservative_terms": 1
        }
      },
      {
        "chunk_id": "C0002",
        "chunk_index": 2,
        "density_score": 2.2,
        "term_highlights": [
          "拉翰林"
        ],
        "signal_counts": {
          "chapter_headings": 0,
          "quoted_phrases": 0,
          "named_places": 1,
          "titled_people": 0,
          "organizations": 0,
          "artifacts": 0,
          "conservative_terms": 1
        }
      },
      {
        "chunk_id": "C0003",
        "chunk_index": 3,
        "density_score": 2.2,
        "term_highlights": [
          "拉翰林"
        ],
        "signal_counts": {
          "chapter_headings": 0,
          "quoted_phrases": 0,
          "named_places": 1,
          "titled_people": 0,
          "organizations": 0,
          "artifacts": 0,
          "conservative_terms": 1
        }
      },
      {
        "chunk_id": "C0004",
        "chunk_index": 4,
        "density_score": 2.2,
        "term_highlights": [
          "翰林"
        ],
        "signal_counts": {
          "chapter_headings": 0,
          "quoted_phrases": 0,
          "named_places": 1,
          "titled_people": 0,
          "organizations": 0,
          "artifacts": 0,
          "conservative_terms": 1
        }
      },
      {
        "chunk_id": "C0009",
        "chunk_index": 9,
        "density_score": 0.4,
        "term_highlights": [],
        "signal_counts": {
          "chapter_headings": 0,
          "quoted_phrases": 0,
          "named_places": 0,
          "titled_people": 0,
          "organizations": 0,
          "artifacts": 0,
          "conservative_terms": 0
        }
      },
      {
        "chunk_id": "C0005",
        "chunk_index": 5,
        "density_score": 0.287,
        "term_highlights": [],
        "signal_counts": {
          "chapter_headings": 0,
          "quoted_phrases": 0,
          "named_places": 0,
          "titled_people": 0,
          "organizations": 0,
          "artifacts": 0,
          "conservative_terms": 0
        }
      }
    ]
  },
  "resolved_strategy": {
    "front_count": 2,
    "tail_count": 2,
    "middle_radius": 1,
    "preview_chars": 220
  }
}