-
Chapter
Process of Text Corpus Generation
In this , shall argue and try to explain our claims that a language , which is being developed for representing a natural language, should be large in the amount of data, multidimensional in , and maxim...
-
Chapter
Statistical Studies on Language Corpus
In this , we make attempt to discuss in brief about various types of statistical approaches that are normally used for and analyzing a as well as for obtaining data which may be considered statistically...
-
Chapter
Corpus as a Primary Resource for ELT
In this , we argue in favor of as a second language to the with direct of (ELC). Kee** various advantages of ELC in view, we address here some of the issues relating to the application of ELC as a...
-
Chapter
Corpus and Word Sense Disambiguation
Every natural language has a large set of , which, when these are used in a piece of , may vary in sense denotation. It has been noted that for ages that , where these are found to be used, can play an ex...
-
Chapter
Corpus and Dictionary Making
Recent works show that a can be made to a certain level of satisfaction if it is made with data and acquired from widely representative and properly balanced language . A language provides an empirical ...
-
Chapter
Corpus and Machine Translation
History shows that a machine translation (MT) system with the support of a few linguistic rules is not realistic. A few rules are not sufficient for capturing the wide variety a natural language exhibits in it...
-
Chapter
Language Corpora: The Indian Scenario
The humble of this is to refer to some of the achievements in the area of generation and databases compilation, which have been done for a few within last two and half decades. We shall also try to r...
-
Chapter
Issues in Text Corpus Generation
In this , we shall briefly discuss some of the basic issues that are directly linked with corpus generation in digital form with the involvement of computer in the process. The act of asks for considerati...
-
Chapter
Corpus Editing and Text Normalization
this , we propose for applying processes like and as some of the essential components of and for making a ready for access across various domains of and . Here, we identify some of the basic...
-
Chapter
Processing Texts in a Corpus
In this , we shall make attempt to discuss some of the most common techniques that are often used for texts stored in a . From the early stage of and , most of these techniques have been quite useful ...
-
Chapter
Corpus as a Secondary Resource for ELT
In this , we propose for utilizing (ELC) as a for (ELT) materials for to . We argue for using ELC as one of the most authentic representative collections of modern language from where we can extr...
-
Chapter
Corpus and Technical TermBank
The development of an exhaustive database of in a natural language carries tremendous importance in the areas of linguistic , , , , , , , , , , as well as in many other domains of and . Kee** ...
-
Chapter
Corpus and Dialect Study
In the present Indian , we find that many minority language communities are living in different sociocultural and geoclimatic regions across the country. Any kind of systematic study on these languages requir...
-
Chapter
Corpus and Some Other Domains
Language is now accepted as one of the primary resources in several branches of application-oriented and -based . In all these branches, is directly and indirectly used for , , and application of vario...
-
Chapter
Corpus and Future Indian Needs
In this , we first try to present a general picture about the present scenario of in the Indian with an appropriate focus on the works already done as well as adequate attention on the works that are in t...
-
Chapter
Features of a Corpus
Defining the characteristic features of a corpus, in general, has been an issue of great debate for decades. Due to diversities involved in the types of text used for corpus generation, identification of featu...
-
Chapter
Pre-digital Corpora (Part 2)
Following the footsteps of the previous chapter (Chap. 9), in this chapter, we have presented a short description of the process of corpus generation and utilization in ...
-
Chapter
Nature of Data
It is always difficult to define the nature of language data since language texts often possess multiple properties, due to which the nature of a particular text may overlap with that of another. However, sinc...
-
Chapter
Digital Text Corpora (Part 2)
The generation of text corpora is not confined to a few widely privileged languages such as English, French, German or Spanish. Many lesser-known and under-privileged languages are also emerging with corpora o...
-
Chapter
Nature of Text Application
In this chapter, we have sketched out how language corpora can be classified based on the nature of the application of texts at various domains of linguistics and language technology. We have argued that a ‘pa...