Netlang: A software for the linguistic analysis of corpora by means of complex networks

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:PLoS One vol. 12, no. 8 (Aug 2017), p. e0181341
1. Verfasser: Barceló-Coblijn, Lluís
Weitere Verfasser: Diego Serna Salazar, Isaza, Gustavo, Castillo Ossa, Luis F, Bedia, Manuel G
Veröffentlicht:
Public Library of Science
Schlagworte:
Online-Zugang:Citation/Abstract
Full Text
Full Text - PDF
Tags: Tag hinzufügen
Keine Tags, Fügen Sie das erste Tag hinzu!

MARC

LEADER 00000nab a2200000uu 4500
001 1931684588
003 UK-CbPIL
022 |a 1932-6203 
024 7 |a 10.1371/journal.pone.0181341  |2 doi 
035 |a 1931684588 
045 2 |b d20170801  |b d20170831 
084 |a 174835  |2 nlm 
100 1 |a Barceló-Coblijn, Lluís 
245 1 |a Netlang: A software for the linguistic analysis of corpora by means of complex networks 
260 |b Public Library of Science  |c Aug 2017 
513 |a Journal Article 
520 3 |a To date there is no software that directly connects the linguistic analysis of a conversation to a network program. Networks programs are able to extract statistical information from data basis with information about systems of interacting elements. Language has also been conceived and studied as a complex system. However, most proposals do not analyze language according to linguistic theory, but use instead computational systems that should save time at the price of leaving aside many crucial aspects for linguistic theory. Some approaches to network studies on language do apply precise linguistic analyses, made by a linguist. The problem until now has been the lack of interface between the analysis of a sentence and its integration into the network that could be managed by a linguist and that could save the analysis of any language. Previous works have used old software that was not created for these purposes and that often produced problems with some idiosyncrasies of the target language. The desired interface should be able to deal with the syntactic peculiarities of a particular language, the options of linguistic theory preferred by the user and the preservation of morpho-syntactic information (lexical categories and syntactic relations between items). Netlang is the first program able to do that. Recently, a new kind of linguistic analysis has been developed, which is able to extract a complexity pattern from the speaker's linguistic production which is depicted as a network where words are inside nodes, and these nodes connect each other by means of edges or links (the information inside the edge can be syntactic, semantic, etc.). The Netlang software has become the bridge between rough linguistic data and the network program. Netlang has integrated and improved the functions of programs used in the past, namely the DGA annotator and two scripts (ToXML.pl and Xml2Pairs.py) used for transforming and pruning data. Netlang allows the researcher to make accurate linguistic analysis by means of syntactic dependency relations between words, while tracking record of the nature of such syntactic relationships (subject, object, etc). The Netlang software is presented as a new tool that solve many problems detected in the past. The most important improvement is that Netlang integrates three past applications into one program, and is able to produce a series of file formats that can be read by a network program. Through the Netlang software, the linguistic network analysis based on syntactic analyses, characterized for its low cost and the completely non-invasive procedure aims to evolve into a sufficiently fine grained tool for clinical diagnosis in potential cases of language disorders. 
651 4 |a Colombia 
653 |a Scripts 
653 |a Accuracy 
653 |a Theory 
653 |a Language acquisition 
653 |a Computer science 
653 |a Syntax 
653 |a Applications programs 
653 |a Language 
653 |a Software 
653 |a Information systems 
653 |a Speeches 
653 |a Computer applications 
653 |a Network analysis 
653 |a Preservation 
653 |a Handbooks 
653 |a Grammar 
653 |a Computer programs 
653 |a Low cost 
653 |a Cost analysis 
653 |a Hypotheses 
653 |a Pruning 
653 |a Nodes 
653 |a Linguistics 
653 |a Information processing 
653 |a Conversation 
653 |a Complexity 
653 |a Bilingualism 
653 |a Integration 
653 |a Language disorders 
653 |a Syntactic analysis 
653 |a Natural language generation 
653 |a Semantics 
653 |a Grammatical relations 
653 |a Corpus analysis 
653 |a Linguists 
653 |a Theoretical linguistics 
653 |a Analysis 
653 |a Medical diagnosis 
653 |a Data 
653 |a Tracking 
653 |a Information 
653 |a Networks 
653 |a Disorders 
653 |a Lexical categories 
653 |a Dependency 
653 |a Social 
700 1 |a Diego Serna Salazar 
700 1 |a Isaza, Gustavo 
700 1 |a Castillo Ossa, Luis F 
700 1 |a Bedia, Manuel G 
773 0 |t PLoS One  |g vol. 12, no. 8 (Aug 2017), p. e0181341 
786 0 |d ProQuest  |t Health & Medical Collection 
856 4 1 |3 Citation/Abstract  |u https://www.proquest.com/docview/1931684588/abstract/embedded/H09TXR3UUZB2ISDL?source=fedsrch 
856 4 0 |3 Full Text  |u https://www.proquest.com/docview/1931684588/fulltext/embedded/H09TXR3UUZB2ISDL?source=fedsrch 
856 4 0 |3 Full Text - PDF  |u https://www.proquest.com/docview/1931684588/fulltextPDF/embedded/H09TXR3UUZB2ISDL?source=fedsrch