Please use this identifier to cite or link to this item: http://hdl.handle.net/1822/599

TitleGrabbing parallel corpora from the web
Author(s)Almeida, J. J.
Simões, Alberto
Castro, José Alves de
KeywordsCorpora paralelos
Web-mining
Issue date2002
Citation“Sociedade Española para el Procesamiento del Lenguaje Natural” 29 (2002), 13-20.
Series/Report no.;29
Abstract(s)Multilingual resources are useful for linguistic studies, translation, and many other tasks. Unfortunately, these resources are difficult to obtain and organize. In this document we describe a set of tools designed to help in the task of mining bilingual resources from the web, from a specific site, from a file system, from a list of URLs, or from a translation memory. As a design goal we intend to build tools that can be used both cooperatively (in pipeline) and also in a independent way.
TypeArticle
URIhttp://hdl.handle.net/1822/599
Peer-Reviewedyes
AccessOpen access
Appears in Collections:DI/CCTC - Artigos (papers)

Files in This Item:
File Description SizeFormat 
parguess.sepln.pdf215,33 kBAdobe PDFView/Open

Partilhe no FacebookPartilhe no TwitterPartilhe no DeliciousPartilhe no LinkedInPartilhe no DiggAdicionar ao Google BookmarksPartilhe no MySpacePartilhe no Orkut
Exporte no formato BibTex mendeley Exporte no formato Endnote Adicione ao seu ORCID