Toward Comprehensive Broad-Coverage Cross-Document Event Extraction

Guardat en:
Dades bibliogràfiques
Publicat a:ProQuest Dissertations and Theses (2024)
Autor principal: Vashishtha, Siddharth
Publicat:
ProQuest Dissertations & Theses
Matèries:
Accés en línia:Citation/Abstract
Full Text - PDF
Etiquetes: Afegir etiqueta
Sense etiquetes, Sigues el primer a etiquetar aquest registre!

MARC

LEADER 00000nab a2200000uu 4500
001 3145382917
003 UK-CbPIL
020 |a 9798346858164 
035 |a 3145382917 
045 2 |b d20240101  |b d20241231 
084 |a 66569  |2 nlm 
100 1 |a Vashishtha, Siddharth 
245 1 |a Toward Comprehensive Broad-Coverage Cross-Document Event Extraction 
260 |b ProQuest Dissertations & Theses  |c 2024 
513 |a Dissertation/Thesis 
520 3 |a Event extraction in natural language processing has traditionally focused on sentence-level analysis, limiting our ability to comprehensively understand complex events across multiple documents. This dissertation introduces a novel approach to comprehensive broad-coverage cross-document event extraction, addressing key limitations in existing methods. The research presents a series of interconnected tasks and datasets that bridge the gap between sentence-level and document-level event understanding, with the ultimate goal of enabling automated extraction of all events from collections of related documents.The work progresses from fundamental tasks of Frame Identification and Source Validation to the introduction of the Frames Across Multiple Sources (FAMuS) corpus, enabling Cross-Document Argument Extraction. Building on these foundations, the dissertation explores methods for extracting multiple event instances within single documents, a crucial step towards fully automated cross-document event extraction. A key innovation is the consistent use of a broad-coverage ontology--FrameNet--as the underlying representation of events. This innovation allows for a wide range of event types to be extracted across diverse domains.This approach, combined with the development of novel annotation methodologies and silver data generation techniques, lays the groundwork for more flexible and scalable event extraction systems. By addressing the challenges of document-level analysis, cross-document information synthesis, and multi-instance event extraction, this research paves the way for comprehensive cross-document event extraction. 
653 |a Computer science 
653 |a Artificial intelligence 
653 |a Linguistics 
773 0 |t ProQuest Dissertations and Theses  |g (2024) 
786 0 |d ProQuest  |t Publicly Available Content Database 
856 4 1 |3 Citation/Abstract  |u https://www.proquest.com/docview/3145382917/abstract/embedded/L8HZQI7Z43R0LA5T?source=fedsrch 
856 4 0 |3 Full Text - PDF  |u https://www.proquest.com/docview/3145382917/fulltextPDF/embedded/L8HZQI7Z43R0LA5T?source=fedsrch