An analysis of Google Translate and DeepL translation of source text typographical errors in the economic and legal fields
Training neural machine translation systems with noisy data has been shown to improve robustness (Heigold et al., 2018). The objective of the present study is to test Google Translate and DeepL performance in the detection and correction of typographical errors, by introducing 1,820 source text typo...
Saved in:
Published in: | Revista de llengua i dret pp. 88 - 105 |
---|---|
Main Author: | |
Format: | Journal Article |
Language: | Aragonese Spanish English |
Published: |
Escola d'Administració Pública de Catalunya
01-06-2024
|
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Abstract | Training neural machine translation systems with noisy data has been shown to improve robustness (Heigold et al., 2018). The objective of the present study is to test Google Translate and DeepL performance in the detection and correction of typographical errors, by introducing 1,820 source text typos found in a previous work on specialised Spanish-English dictionaries (Rodríguez-Rubio & Fernández-Quesada, 2020a, 2020b; Rodríguez-Rubio Mediavilla, 2021). Typos were introduced in isolation and also in co-text. Results showed that Google Translate clearly outperformed DeepL. Moreover, the repetition of the source typo was found to be the most frequent phenomenon in the machine translation output of both systems. By shedding light on the capacity of systems to deal with source text typographical errors, our study aims to provide a starting point for their refinement. |
---|---|
AbstractList | Training neural machine translation systems with noisy data has been shown to improve robustness (Heigold et al., 2018). The objective of the present study is to test Google Translate and DeepL performance in the detection and correction of typographical errors, by introducing 1,820 source text typos found in a previous work on specialised Spanish-English dictionaries (Rodríguez-Rubio & Fernández-Quesada, 2020a, 2020b; Rodríguez-Rubio Mediavilla, 2021). Typos were introduced in isolation and also in co-text. Results showed that Google Translate clearly outperformed DeepL. Moreover, the repetition of the source typo was found to be the most frequent phenomenon in the machine translation output of both systems. By shedding light on the capacity of systems to deal with source text typographical errors, our study aims to provide a starting point for their refinement. |
Author | Santiago Rodríguez-Rubio Mediavilla |
Author_xml | – sequence: 1 fullname: Santiago Rodríguez-Rubio Mediavilla |
BookMark | eNotjc1OAjEUhRuDiYi8gKu-wGD_ZnpnSVCRhMQNrielvYWaMiXtmIhPLyKrk3znyzn3ZNSnHgl55GxWQ9uKpxzdLACfCSbUTHGAGzIWjMuKq1qOyJgJLqqa1c0dmZYStkw0reYgYEx-5j01vYmnEgpNni5T2kWkm2z6Es2A59LRZ8Tjmg5XFlL_Z5b0lS3SAb8HOpyOaZfNcR-siRRzTrnQ0NNhjxRt6tMh2MtSxN1Z8AGjKw_k1ptYcHrNCfl4fdks3qr1-3K1mK8rx2s-VN612oPdGvAASmjHGPeuqZlDplBapVUjpa8t2LMKYGspNTbSKKWlVFZOyOp_1yXz2R1zOJh86pIJ3QWkvOtMHoKN2DEtUTfeaAlKgZatgfPTttEAWimr5S8gz3Bh |
ContentType | Journal Article |
DBID | DOA |
DOI | 10.58992/rld.i81.2024.4188 |
DatabaseName | Directory of Open Access Journals |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: DOA name: Directory of Open Access Journals url: http://www.doaj.org/ sourceTypes: Open Website |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Languages & Literatures |
EISSN | 2013-1453 |
EndPage | 105 |
ExternalDocumentID | oai_doaj_org_article_073e76fa738448739a8d65b6788744c7 |
GroupedDBID | .4L 1XV 2VB ADBBV ALMA_UNASSIGNED_HOLDINGS BCNDV EBS EJD GROUPED_DOAJ HEY KQ8 OK1 RNS ~Y0 |
ID | FETCH-LOGICAL-d151t-fd97f8cba8f88427d001fd650de04e3c474633f5c8cfd988c5337e63a447334c3 |
IEDL.DBID | DOA |
ISSN | 0212-5056 |
IngestDate | Tue Oct 22 15:16:54 EDT 2024 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | true |
Language | Aragonese English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-d151t-fd97f8cba8f88427d001fd650de04e3c474633f5c8cfd988c5337e63a447334c3 |
OpenAccessLink | https://doaj.org/article/073e76fa738448739a8d65b6788744c7 |
PageCount | 18 |
ParticipantIDs | doaj_primary_oai_doaj_org_article_073e76fa738448739a8d65b6788744c7 |
PublicationCentury | 2000 |
PublicationDate | 2024-06-01 |
PublicationDateYYYYMMDD | 2024-06-01 |
PublicationDate_xml | – month: 06 year: 2024 text: 2024-06-01 day: 01 |
PublicationDecade | 2020 |
PublicationTitle | Revista de llengua i dret |
PublicationYear | 2024 |
Publisher | Escola d'Administració Pública de Catalunya |
Publisher_xml | – name: Escola d'Administració Pública de Catalunya |
SSID | ssib026971828 ssj0000498125 |
Score | 2.3141017 |
Snippet | Training neural machine translation systems with noisy data has been shown to improve robustness (Heigold et al., 2018). The objective of the present study is... |
SourceID | doaj |
SourceType | Open Website |
StartPage | 88 |
SubjectTerms | deepl google translate neural machine translation specialised texts translation typographical errors |
Title | An analysis of Google Translate and DeepL translation of source text typographical errors in the economic and legal fields |
URI | https://doaj.org/article/073e76fa738448739a8d65b6788744c7 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV09T8MwELWgEwvi-xvdgNjSJrEdO2OBlg4VCyCxRYk_UFGVVEk6wK_nHEdVNxZWx7Eiv1PeO-vumZC7otCUa84DWwgRMG1UUMRcBawoWBRaFRrtjgZmr-LlQz5NnE3O5qovVxPm7YH9xo0wBI1IbC6oxExC0DSXOuFF4qrgGFO-jzxMt5IpjKQ4SfGf2wvrL6-Dkcm6esbYmf8j7fsOGo75Rjyql3q4kC5djNmQRe4ali0H_45qpgdkv9eIMPbfdkh28vKInM37k8UG7mG-MUNujsnPuIS89xaBysJzVX0uDXQstEQpiQ81PBmzmkPbjyEYbqY_uQdX_AHt98q7VzvUwNR1VTewKAEFIpi-e7lbaWmQVKCrfGtOyPt08vY4C_orFQKN1N4GVqfCSlXk0krJYqGRpSzuZ6hNyAxVTLCEUsuVVDhVSoVqUJiE5owJSpmip2RQVqU5J8BdMhJKKyKdoiaL8iiVBle03OKLVlyQB7eF2cq7ZmTOx7obQHSzHt3sL3Qv_2ORK7LngPUFXtdk0NZrc0N2G72-7aLmF4cZwzQ |
link.rule.ids | 315,783,787,867,2109,27936,27937 |
linkProvider | Directory of Open Access Journals |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=An+analysis+of+Google+Translate+and+DeepL+translation+of+source+text+typographical+errors+in+the+economic+and+legal+fields&rft.jtitle=Revista+de+llengua+i+dret&rft.au=Santiago+Rodr%C3%ADguez-Rubio+Mediavilla&rft.date=2024-06-01&rft.pub=Escola+d%27Administraci%C3%B3+P%C3%BAblica+de+Catalunya&rft.issn=0212-5056&rft.eissn=2013-1453&rft.spage=88&rft.epage=105&rft_id=info:doi/10.58992%2Frld.i81.2024.4188&rft.externalDBID=DOA&rft.externalDocID=oai_doaj_org_article_073e76fa738448739a8d65b6788744c7 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0212-5056&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0212-5056&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0212-5056&client=summon |