Converting a constituency treebank to dependency treebank for Vietnamese

Dependency parsing has become the norm for its advantages of representing syntactic information for numerous tasks of natural language processing (NLP). In Vietnamese, a challenging problem which arises in this domain is the insufficiency of the training resource. Our work presents a new method to a...

Full description

Saved in:
Bibliographic Details
Published in:2022 RIVF International Conference on Computing and Communication Technologies (RIVF) pp. 256 - 261
Main Authors: Truong, Chau M., Pham, Tai V., Phan, Minh N., Le, Nhan D. T., Nguyen, Thinh V., Nguyen, Quy T.
Format: Conference Proceeding
Language:English
Published: IEEE 20-12-2022
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Dependency parsing has become the norm for its advantages of representing syntactic information for numerous tasks of natural language processing (NLP). In Vietnamese, a challenging problem which arises in this domain is the insufficiency of the training resource. Our work presents a new method to automatically convert a Vietnamese constituency treebank into dependency trees. We designed new dependency labels for Vietnamese treebank. Furthermore, in this research, we proposed new head-percolation rules and dependency relations. The experimental results on two state-of-the-art parsers, MaltParser and MSTParser, indicated that our treebank were roughly 13% UAS and 21% LAS higher than previous works.
DOI:10.1109/RIVF55975.2022.10013806