Different Issues in the Design of a Lemmatizer/Tagger for Basque

This paper presents relevant issues that have been considered in the design of a general purpose lemmatizer/tagger for Basque (EUSLEM). The lemmatizer/tagger is conceived as a basic tool necessary for other linguistic applications. It uses the lexical data base and the morphological analyzer previou...

Full description

Saved in:
Bibliographic Details
Main Authors: Aduriz, I, Alegria, I, Arriola, J. M, Artola, X, A, Diaz de Illarraza, Ezeiza, N, Gojenola, K, Maritxalar, M
Format: Journal Article
Language:English
Published: 20-03-1995
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This paper presents relevant issues that have been considered in the design of a general purpose lemmatizer/tagger for Basque (EUSLEM). The lemmatizer/tagger is conceived as a basic tool necessary for other linguistic applications. It uses the lexical data base and the morphological analyzer previously developed and implemented. Due to the characteristics of the language, the tagset here proposed in structured in for levels, so that each level is a refinement of the previous one in the sense that it adds more detailed information. We will focus on the problems found in designing this tagset and on the strategies for morphological disambiguation that will be used.
DOI:10.48550/arxiv.cmp-lg/9503020