郷土に残存する江戸期古記録の機械可読化を目的とした市民参加および機械学習による固有表現抽出

吉賀, 夏子; 堀, 良彰; 只木, 進一; 永崎, 研宣; 伊藤, 昭弘; Natsuko, Yoshiga; Yoshiaki, Hori; Shin-ichi, Tadaki; Kiyonori, Nagasaki; Akihiro, Ito

WEKO3

インデックスツリー

RootNode

アイテム

郷土に残存する江戸期古記録の機械可読化を目的とした市民参加および機械学習による固有表現抽出

https://doi.org/10.20729/00216238

名前 / ファイル	ライセンス	アクション
IPSJ-JNL6302008.pdf (2.5 MB)	Copyright (c) 2022 by the Information Processing Society of Japan
オープンアクセス

Item type

Journal(1)

公開日

2022-02-15

タイトル

郷土に残存する江戸期古記録の機械可読化を目的とした市民参加および機械学習による固有表現抽出

タイトル

言語

タイトル

Named Entities Extraction by Citizen Participation and Machine Learning for Making Machine-readable Old Records of the Edo Period Remaining in Local Communities

言語

jpn

キーワード

主題Scheme

Other

主題

[特集:人文科学とコンピュータ] 江戸期古記録，シチズンサイエンス，ディープラーニング，固有表現抽出，単語分散表現

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_6501

資源タイプ

journal article

ID登録

10.20729/00216238

ID登録タイプ

JaLC

著者所属

佐賀大学地域学歴史文化研究センター

著者所属

佐賀大学総合情報基盤センター

著者所属

佐賀大学理工学部

著者所属

一般財団法人人文情報学研究所

著者所属

佐賀大学地域学歴史文化研究センター

著者所属(英)

The Center for Regional History and Culture, Saga University

著者所属(英)

Computer and Network Center, Saga University

著者所属(英)

Department of Science and Engineering, Saga University

著者所属(英)

International Institute for Digital Humanities

著者所属(英)

The Center for Regional History and Culture, Saga University

著者名

吉賀, 夏子
堀, 良彰
只木, 進一
永崎, 研宣
伊藤, 昭弘

著者名(英)

Natsuko, Yoshiga
Yoshiaki, Hori
Shin-ichi, Tadaki
Kiyonori, Nagasaki
Akihiro, Ito

論文抄録

内容記述タイプ

Other

内容記述

わが国には，江戸時代以前に記された業務記録や証文などの古記録が数多く存在する．これらを有効に活用するためには，少ない工数で機械可読データを構築する必要がある．特に，地域特有の資料の場合には，地域特有の固有表現への対応が必要となる．本研究では，江戸期の業務日誌である「小城藩日記データベース」の目録記事文からLinked Dataなどの機械可読データを生成することを具体的目標とし，固有表現抽出の効率化を行う．その第1の手法は，市民参加による人手そのものの有効活用である．第2の手法は，機械学習による固有表現の自動抽出である．これらの手法を組み合わせることで，通常は収集の難しい地域特有の固有表現を記事文から，自動かつ高精度で抽出可能である．

論文抄録(英)

内容記述タイプ

Other

内容記述

There are many ancient documents such as business records and testimonials written before the Edo period in Japan. Machine-readable metadata will be one of effective tools for utilizing those records. In cases of materials related to a very small area, in particular, it is necessary to deal with unique expressions restricted in the area. In this study, we set a specific goal to generate machine-readable metadata such as Linked Data from the database of the cataloged articles for the Ogi-han Nikki (business records) from the Edo period. We aim to improve the efficiency in extraction processes of named entities. For this purpose, we employ two methods. The first is effective use of human resources through citizen participation. The second is automated extraction of named entities by machine learning. We show that the proposed method works well even for materials related to a local area.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN00116647

書誌情報

情報処理学会論文誌

巻 63, 号 2, p. 310-323, 発行日 2022-02-15

ISSN

収録物識別子タイプ

ISSN

収録物識別子

1882-7764

戻る

views

See details

	Views

Versions

Ver.1

2025-01-19 15:48:52.431300

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

郷土に残存する江戸期古記録の機械可読化を目的とした市民参加および機械学習による固有表現抽出

× 吉賀, 夏子

× 堀, 良彰

× 只木, 進一

× 永崎, 研宣

× 伊藤, 昭弘

× Natsuko, Yoshiga

× Yoshiaki, Hori

× Shin-ichi, Tadaki

× Kiyonori, Nagasaki

× Akihiro, Ito

Versions

Share

Cite as

エクスポート