Abstract Wikipedia/Related and previous work/Natural language generation/th

This page is a translated version of the page Abstract Wikipedia/Related and previous work/Natural language generation and the translation is 24% complete.

Abstract Wikipedia will generate natural language text from an abstract representation. This is not a novel idea, and it has been tried a number of times before.

This page aims to collect different existing approaches. It tries to summarize the core ideas of the different approaches, their advantages and disadvantages, and points to existing implementations. This page (by and for the community) will help to choose which approach to focus on first.

Implementations

Arria NLG

วิกิพีเดีย: Arria NLG [ de ] [ en ] [ nn ]
เว็บไซต์: https://www.arria.com/
สัญญาอนุญาต: Proprietary, 30 patents apply
ภาษาที่สนับสนุน: English

ASTROGEN

เว็บไซต์: http://www.dsv.su.se/~hercules/ASTROGEN/ASTROGEN.html

Chimera

เว็บไซต์: https://github.com/AmitMY/chimera
สัญญาอนุญาต: MIT License

Elvex

เว็บไซต์: https://github.com/lionelclement/Elvex

FUF/SURGE

เว็บไซต์: https://www.cs.bgu.ac.il/~elhadad/surge

Genl

เว็บไซต์: http://kowey.github.io/GenI/

GoPhi

เว็บไซต์: https://github.com/rali-udem/gophi

Grammar Explorer

เว็บไซต์: http://www.fb10.uni-bremen.de/anglistik/langpro/kpml/tutorials/Grexplorer/grexplorer.html

Grammatical Framework

วิกิพีเดีย: Grammatical Framework [ en ] [ nn ]
เว็บไซต์: https://www.grammaticalframework.org/
สัญญาอนุญาต: GNU General Public License: see text
ภาษาที่สนับสนุน: Afrikaans, Amharic (partial), Arabic (partial), Basque (partial), Bulgarian, Catalan, Chinese, Czech (partial), Danish, Dutch, English, Estonian, Finnish, French, German, Greek ancient (partial), Greek modern, Hebrew (fragments), Hindi, Hungarian (partial), Interlingua, Italian, Japanese, Korean (partial), Latin (partial), Latvian, Maltese, Mongolian, Nepali, Norwegian bokmål, Norwegian nynorsk, Persian, Polish, Punjabi, Romanian, Russian, Sindhi, Slovak (partial), Slovene (partial), Somali (partial), Spanish, Swahili (fragments), Swedish, Thai, Turkish (fragments), and Urdu.

jsRealB

เว็บไซต์: http://rali.iro.umontreal.ca/rali/?q=en/jsrealb-bilingual-text-realiser

KPML

เว็บไซต์: http://www.fb10.uni-bremen.de/anglistik/langpro/kpml/README.html
ภาษาที่สนับสนุน: (2014):
- More advanced: Czech, English, German?, Spanish
- Prototype: Bulgarian, Chinese, Dutch, Portuguese, Russian
- Less advanced: French, Greek, Japanese

Linguistic Knowledge Builder

เว็บไซต์: http://moin.delph-in.net/LkbTop

Multimodal Unification Grammar

เว็บไซต์: https://david-reitter.nfshost.com/compling/mug/index.html

NaturalOWL

NLGen and NLGen2

เว็บไซต์: https://launchpad.net/nlgen
https://launchpad.net/nlgen2

OpenCCG

เว็บไซต์: http://openccg.sourceforge.net/

rLDCP

เว็บไซต์: https://cran.r-project.org/web/packages/rLDCP/index.html

RoseaNLG

เว็บไซต์: https://rosaenlg.org/
ภาษาที่สนับสนุน: English, French, German and Italian

Semantic Web Authoring Tool (SWAT)

วิกิพีเดีย: WYSIWYM [ en ] [ nn ] A SWAT is a tool that implements the WYSIWYM (what you see is what you meant) interaction technique for developing formal representations based on successive refinements (by humans) of NLG outputs.
เว็บไซต์: http://mcs.open.ac.uk/nlg/SWAT/
ภาษาที่สนับสนุน: OWL Simplified English

SimpleNLG

เว็บไซต์: https://github.com/simplenlg/simplenlg
ภาษาที่สนับสนุน: English, French

SPUD

เว็บไซต์: https://www.cs.rutgers.edu/~mdstone/nlg.html

Suregen-2

เว็บไซต์: http://www.suregen.de/index.html
ภาษาที่สนับสนุน: German, English

Syntax Maker

เว็บไซต์: https://github.com/mikahama/syntaxmaker
ภาษาที่สนับสนุน: Finnish

TGen

เว็บไซต์: https://github.com/UFAL-DSG/tgen

Universal Networking Language

วิกิพีเดีย: Universal Networking Language [ de ] [ en ] [ es ] [ fr ] [ 日本語 ] [ nn ]

UralicNLP

เว็บไซต์: https://uralicnlp.com/
https://github.com/mikahama/uralicNLP
ภาษาที่สนับสนุน: Finnish, Russian, German, English, Norwegian, Swedish, Arabic, Ingrian, Meadow & Eastern Mari, Votic, Olonets-Karelian, Erzya, Moksha, Hill Mari, Udmurt, Tundra Nenets, Komi-Permyak, North Sami, South Sami and Skolt Sami^[1]

Theoretical background

Please note that the six topics listed above have articles only in the English Wikipedia (24 July 2020).

Natural language generation [ de ] [ en ] [ es ] [ fr ] [ 日本語 ] [ nn ] [ 中文 ] is a sub-field of natural language processing. See the broader topic on Scholia.^[2]

Pipeline model

In their 2018 Survey,^[3] Gatt^[4] and Krahmer^[5] begin by describing natural language generation as the "task of generating text or speech from non-linguistic input." They identify six sub-problems (after Reiter & Dale 1997, 2000^[6]) [2.NLG Tasks, pp. 70-82]:^[3]

These six sub-problems can be seen as a segmentation of the “pipeline”, beginning with “early” tasks, aligned to the purpose of the linguistic output. The “late” tasks are more aligned to the final linguistic form. A summary form might be “What (1), ordered (2) and segmented (3) how, with which words (4&5), in which forms (6)”. Lexicalisation (4) is not clearly distinguished from “referring expression generation” (REG) (5) in this summary form. The key idea during REG is avoiding repetition and ambiguity, or managing the tension between those conflicting aims. This corresponds to the Gricean maxim (Grice, 1975^[7]) that “speakers should make sure that their contributions are sufficiently informative for the purposes of the exchange, but not more so” (or, as Roger Sessions said (1950) after Albert Einstein (1933): “everything should be as simple as it can be but not simpler!”).

Content determination

Document structuring

Aggregation

Lexical choice

Referring expression generation

Realization

“In linguistics, realization is the process by which some kind of surface representation is derived from its underlying representation; that is, the way in which some abstract object of linguistic analysis comes to be produced in actual language. Phonemes are often said to be realized by speech sounds. The different sounds that can realize a particular phoneme are called its allophones.”

“Realization is also a subtask of natural language generation, which involves creating an actual text in a human language (English, French, etc.) from a syntactic representation.”

วิกิพีเดียภาษาอังกฤษ

(Wikipedia contributors, “Realization”, Wikipedia, The Free Encyclopedia, 26 May 2020, 02:46 UTC, <https://en.wikipedia.org/w/index.php?title=Realization&oldid=958866516> [accessed 31 August 2020].)

Black-box approach

In a later survey, Gârbacea and Mei^[8] suggested “Neural language generation” as an emerging sub-field of NLG. Eleven of the papers cited in their survey have titles with “neural language” in them, the earliest from 2016 (Édouard Grave, Armand Joulin, and Nicolas Usunier)^[9]. The earliest citation in which “neural language generation” appears is from 2017 (Jessica Ficler and Yoav Goldberg)^[10].

In mid 2020, “neural language generation” is not mature enough to be used to generate natural language renditions of language-neutral content.

อ้างอิง

Jessica Ficler and Yoav Goldberg, 2017^[10]
Édouard Grave, Armand Joulin, and Nicolas Usunier, 2016^[9]
Gârbacea and Mei, 2020^[8]
Gardent et al., 2017^[11]
Gatt & Krahmer, 2018^[3]
Grice, 1975^[7]
Reiter & Dale, 2000^[6] (PDF ends at the end of the first section.)

แหล่งข้อมูลอื่น

หมายเหตุ

↑ https://models.uralicnlp.com/nightly/
↑ The Scholia view on Natural-language generation lacked the standard sources and leading authors on 27 July 2020. Instead, see Google Scholar.
↑ ^a ^b ^c Gatt, Albert; Krahmer, Emiel (January 2018), "Survey of the State of the Art in Natural Language Generation: Core tasks, applications and evaluation", Journal of Artificial Intelligence Research 61: 65–170, archived from the original on 2020-06-23, retrieved 2020-07-24
↑ Gatt's publications
↑ Emiel Krahmer (Q51689943) selected publications
↑ ^a ^b Reiter, EB; Dale, R (2000), Building Natural-Language Generation Systems. (PDF), Cambridge University Press., archived from the original (PDF) on 2019-07-11, retrieved 2020-07-27
↑ ^a ^b Grice, H. Paul (1975), Logic and conversation (PDF), retrieved 2020-08-10
↑ ^a ^b Gârbacea, Cristina; Mei, Qiaozhu, Neural Language Generation: Formulation, Methods, and Evaluation (PDF), pp. 1–70, retrieved 2020-08-08, Compared to the survey of (Gatt and Krahmer, 2018), our overview is a more comprehensive and updated coverage of neural network methods and evaluation centered around the novel problem definitions and task formulations.
↑ ^a ^b Grave, Édouard; Joulin, Armand; Usunier, Nicolas (2016), Improving neural language models with a continuous cache (PDF)
↑ ^a ^b Ficler, Jessica; Goldberg, Yoav (2017), "Controlling linguistic style aspects in neural language generation" (PDF), Proceedings of the Workshop on Stylistic Variation: 94–104 . Published slightly earlier that year was Van-Khanh Tran and Le-Minh Nguyen. 2017.
Ficler, Jessica; Goldberg, Yoav (2017), Semantic Refinement GRU-based Neural Language Generation for Spoken Dialogue Systems (PDF)
↑ Gardent, Claire; Shimorina, Anastasia; Narayan, Shashi; Perez-Beltrachini, Laura (2017), "The WebNLG Challenge: Generating Text from RDF data." (PDF), Proceedings of the 10th International Conference on Natural Language Generation: 124–133

[1] ttps://models.uralicnlp.com/nightly/

[ScoliaNLG-2] The Scholia view on Natural-language generation lacked the standard sources and leading authors on 27 July 2020. Instead, see Google Scholar.

[Gatt-3] Gatt, Albert; Krahmer, Emiel (January 2018), "Survey of the State of the Art in Natural Language Generation: Core tasks, applications and evaluation", Journal of Artificial Intelligence Research 61: 65–170, archived from the original on 2020-06-23, retrieved 2020-07-24

[4] Gatt's publications

[5] Emiel Krahmer (Q51689943) selected publications

[Reiter-6] Reiter, EB; Dale, R (2000), Building Natural-Language Generation Systems. (PDF), Cambridge University Press., archived from the original (PDF) on 2019-07-11, retrieved 2020-07-27

[Grice-7] Grice, H. Paul (1975), Logic and conversation (PDF), retrieved 2020-08-10

[Gârbacea-8] Gârbacea, Cristina; Mei, Qiaozhu, Neural Language Generation: Formulation, Methods, and Evaluation (PDF), pp. 1–70, retrieved 2020-08-08, Compared to the survey of (Gatt and Krahmer, 2018), our overview is a more comprehensive and updated coverage of neural network methods and evaluation centered around the novel problem definitions and task formulations.

[Grave-9] Grave, Édouard; Joulin, Armand; Usunier, Nicolas (2016), Improving neural language models with a continuous cache (PDF)

[Ficler-10] Ficler, Jessica; Goldberg, Yoav (2017), "Controlling linguistic style aspects in neural language generation" (PDF), Proceedings of the Workshop on Stylistic Variation: 94–104 . Published slightly earlier that year was Van-Khanh Tran and Le-Minh Nguyen. 2017.
Ficler, Jessica; Goldberg, Yoav (2017), Semantic Refinement GRU-based Neural Language Generation for Spoken Dialogue Systems (PDF)

[Gardent-11] Gardent, Claire; Shimorina, Anastasia; Narayan, Shashi; Perez-Beltrachini, Laura (2017), "The WebNLG Challenge: Generating Text from RDF data." (PDF), Proceedings of the 10th International Conference on Natural Language Generation: 124–133

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]