Utilize este identificador para referenciar este registo: https://hdl.handle.net/1822/42615

Registo completo
Campo DCValorIdioma
dc.contributor.authorBrito, Rui Miguel Magalhãespor
dc.contributor.authorAlmeida, J. J.por
dc.contributor.authorSimões, Albertopor
dc.date.accessioned2016-09-15T08:44:34Z-
dc.date.available2016-09-15T08:44:34Z-
dc.date.issued2014-11-
dc.identifier.citationBrito, Rui, José João Almeida, e Alberto Simões. 2014. Processing annotated TMX parallel corpora. Em IberSpeech 2014 --- VIII Jornadas en Tecnologías del Habla and IV Iberian SLTech Workshop, pp. 188--197, Las Palmas de Gran Canaria, Spain, November, 2014por
dc.identifier.isbn978-84-617-2862-6-
dc.identifier.urihttps://hdl.handle.net/1822/42615-
dc.description.abstractIn the later years the amount of freely available multilingual corpora has grown in an exponential way. Unfortunately the way these corpora are made available is very diverse, ranging from simple text files or specific XML schemas to supposedly standard formats like the XML Corpus Encoding Initiative, the Text Encoding Initiative, or even the Translation Memory Exchange formats. In this document we defend the usage of Translation Memory Exchange documents, but we enrich its structure in order to support the annotation of the documents with different information like lemmas, multi-words or entities. To support the adoption of the proposed formats, we present a set of tools to manipulate the different formats in an agile way.por
dc.language.isoengpor
dc.relationinfo:eu-repo/grantAgreement/FCT/5876/135947/PTpor
dc.rightsopenAccesspor
dc.subjectCorpora paralelospor
dc.subjectTMXpor
dc.subjectPLNpor
dc.subjectParallel corporapor
dc.subjectAnnotated corporapor
dc.titleProcessing Annotated TMX Parallel Corporapor
dc.typeconferencePaperpor
dc.peerreviewedyespor
dc.relation.publisherversionhttp://iberspeech2014.ulpgc.es/images/Iberspeech2014_OnlineProceedings.pdfpor
sdum.publicationstatusinfo:eu-repo/semantics/publishedVersionpor
oaire.citationStartPage188por
oaire.citationEndPage197por
oaire.citationTitleIberSpeech 2014 - VIII Jornadas en Tecnologías del Habla and IV Iberian SLTech Workshoppor
dc.subject.fosCiências Naturais::Ciências da Computação e da Informaçãopor
sdum.conferencePublicationIberSpeech 2014 - VIII Jornadas en Tecnologías del Habla and IV Iberian SLTech Workshoppor
Aparece nas coleções:CEHUM - Artigos em livros de atas

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato 
tmxa.pdf373,79 kBAdobe PDFVer/Abrir

Partilhe no FacebookPartilhe no TwitterPartilhe no DeliciousPartilhe no LinkedInPartilhe no DiggAdicionar ao Google BookmarksPartilhe no MySpacePartilhe no Orkut
Exporte no formato BibTex mendeley Exporte no formato Endnote Adicione ao seu ORCID