Computational Processing of the Portuguese Language
12th International Conference, PROPOR 2016, Tomar, Portugal, July 13-15, 2016, Proceedings
Chapter
This chapter provides an analysis of the level of technological preparation of the Portuguese language for the digital age, as well as the actions necessary for the consolidation of Portuguese as a language of...
Chapter and Conference Paper
To advance the neural encoding of Portuguese (PT), and a fortiori the technological preparation of this language for the digital age, we developed a Transformer-based foundation model that sets a new state of ...
Chapter and Conference Paper
We report on the application of a neural network based approach to the problem of automatically categorizing texts according to their proficiency levels and suitability for learners of Portuguese as a second l...
Chapter and Conference Paper
Neural machine translation needs a very large volume of data to unfold its potential. Self-learning with back-translation became widely adopted to address this data scarceness bottleneck: a seed system is used...
Chapter and Conference Paper
This paper addresses the issue of how to obtain processing tools for argument identification for the vast majority of the languages that, differently from English, have little to no relevant labeled data.
Chapter and Conference Paper
The generation of synthetic parallel corpora through the automatic translation of a monolingual text, a process known as back-translation, is a technique used to augment the amount of parallel data available ...
Chapter and Conference Paper
Machine Translation (MT) has been one of the classic AI tasks from the early days of the field. Portuguese and Chinese are languages with a very large number of native speakers, though this does not carry thro...
Chapter and Conference Paper
The central goal of this paper is to report on the results of an experimental study on the application of character-level embeddings and basic convolutional neural network to the shared task of sentence paraph...
Article
Article
Chapter and Conference Paper
Despite its relatively short period of existence as a scientific area, natural language processing has gone through a succession of diverse mainstream research paradigms. How similar are these inflection momen...
Article
This article provides an overview of the dissemination work carried out in META-NET from 2010 until 2015; we describe its impact on the regional, national and international level, mainly with regard to politic...
Book and Conference Proceedings
12th International Conference, PROPOR 2016, Tomar, Portugal, July 13-15, 2016, Proceedings
Chapter and Conference Paper
The semantic annotation of corpora has an important role to play in ensuring that sentences occurring in natural language texts are correctly understood based on their intended context. Two examples of lexical...
Chapter and Conference Paper
Machine translation (MT) from English to Portuguese has not typically received much attention in existing research. In this paper, we focus on MT from English to Portuguese for the specific domain of informati...
Chapter and Conference Paper
In this article we describe the creation and distribution of the first publicly available word embeddings for Portuguese. Our embeddings are evaluated on their own and also compared with the original English m...
Chapter and Conference Paper
This paper is concerned with a tool that supports human experts in their task of classifying text excerpts suitable to be used in quizzes for learning materials and as items of exams that are aimed at assessin...
Chapter and Conference Paper
Multi-document summarization aims to create a single summary based on the information conveyed by a collection of texts. After the candidate sentences have been identified and ordered, it is time to select whi...
Chapter and Conference Paper
We present a new collection of treebanks for the Portuguese language, comprising five datasets that cover major types of grammatically annotated corpora: TreeBankPT, PropBankPT, DependencyBankPT, LogicalFormBa...
Chapter and Conference Paper
This paper presents a machine learning approach to find and classify discourse relations between two unseen sentences. It describes the process of training a classifier that aims to determine (i) if there is a...