Skip to main content

Page of 3
and
  1. No Access

    Chapter

    Corpus as a Secondary Resource for ELT

    In this , we propose for utilizing (ELC) as a for (ELT) materials for to . We argue for using ELC as one of the most authentic representative collections of modern language from where we can extr...

    Niladri Sekhar Dash, L. Ramamoorthy in Utility and Application of Language Corpora (2019)

  2. No Access

    Chapter

    Corpus and Technical TermBank

    The development of an exhaustive database of in a natural language carries tremendous importance in the areas of linguistic , , , , , , , , , , as well as in many other domains of and . Kee** ...

    Niladri Sekhar Dash, L. Ramamoorthy in Utility and Application of Language Corpora (2019)

  3. No Access

    Chapter

    Corpus and Dialect Study

    In the present Indian , we find that many minority language communities are living in different sociocultural and geoclimatic regions across the country. Any kind of systematic study on these languages requir...

    Niladri Sekhar Dash, L. Ramamoorthy in Utility and Application of Language Corpora (2019)

  4. No Access

    Chapter

    Corpus and Some Other Domains

    Language is now accepted as one of the primary resources in several branches of application-oriented and -based . In all these branches, is directly and indirectly used for , , and application of vario...

    Niladri Sekhar Dash, L. Ramamoorthy in Utility and Application of Language Corpora (2019)

  5. No Access

    Chapter

    Corpus and Future Indian Needs

    In this , we first try to present a general picture about the present scenario of in the Indian with an appropriate focus on the works already done as well as adequate attention on the works that are in t...

    Niladri Sekhar Dash, L. Ramamoorthy in Utility and Application of Language Corpora (2019)

  6. No Access

    Article

    Word Sense Disambiguation in Bangla Language Using Supervised Methodology with Necessary Modifications

    An attempt is made in this paper to report how a supervised methodology has been adopted for the task of word sense disambiguation in Bangla with necessary modifications. At the initial stage, the Naïve Bayes ...

    Alok Ranjan Pal, Diganta Saha in Journal of The Institution of Engineers (I… (2018)

  7. No Access

    Chapter and Conference Paper

    Application of TF-IDF Feature for Categorizing Documents of Online Bangla Web Text Corpus

    This paper explores the use of standard features as well as machine learning approaches for categorizing Bangla text documents of online Web corpus. The TF-IDF feature with dimensionality reduction technique (...

    Ankita Dhar, Niladri Sekhar Dash, Kaushik Roy in Intelligent Engineering Informatics (2018)

  8. No Access

    Chapter and Conference Paper

    Categorization of Bangla Web Text Documents Based on TF-IDF-ICF Text Analysis Scheme

    With the rapid growth and huge availability of digital text data, automatic text categorization or classification is a comparatively more effective solution in organizing and managing textual information. It i...

    Ankita Dhar, Niladri Sekhar Dash, Kaushik Roy in Social Transformation – Digital Way (2018)

  9. No Access

    Chapter

    Features of a Corpus

    Defining the characteristic features of a corpus, in general, has been an issue of great debate for decades. Due to diversities involved in the types of text used for corpus generation, identification of featu...

    Niladri Sekhar Dash, S. Arulmozi in History, Features, and Typology of Language Corpora (2018)

  10. No Access

    Chapter

    Pre-digital Corpora (Part 2)

    Following the footsteps of the previous chapter (Chap. 9), in this chapter, we have presented a short description of the process of corpus generation and utilization in ...

    Niladri Sekhar Dash, S. Arulmozi in History, Features, and Typology of Language Corpora (2018)

  11. No Access

    Chapter

    Nature of Data

    It is always difficult to define the nature of language data since language texts often possess multiple properties, due to which the nature of a particular text may overlap with that of another. However, sinc...

    Niladri Sekhar Dash, S. Arulmozi in History, Features, and Typology of Language Corpora (2018)

  12. No Access

    Chapter

    Digital Text Corpora (Part 2)

    The generation of text corpora is not confined to a few widely privileged languages such as English, French, German or Spanish. Many lesser-known and under-privileged languages are also emerging with corpora o...

    Niladri Sekhar Dash, S. Arulmozi in History, Features, and Typology of Language Corpora (2018)

  13. No Access

    Chapter

    Nature of Text Application

    In this chapter, we have sketched out how language corpora can be classified based on the nature of the application of texts at various domains of linguistics and language technology. We have argued that a ‘pa...

    Niladri Sekhar Dash, S. Arulmozi in History, Features, and Typology of Language Corpora (2018)

  14. No Access

    Chapter

    Utilization of Language Corpora

    Even after nearly 70 years, the staunch supporters of the generative genre still like to argue that linguistics is a branch of intuition and introspection where corpora, as a showcase of empirical language dat...

    Niladri Sekhar Dash, S. Arulmozi in History, Features, and Typology of Language Corpora (2018)

  15. No Access

    Chapter

    Web Text Corpus

    The World Wide Web is viewed as a useful linguistic resource since it is a unique linguistic world that is full of surprising linguistic data and information. It is the largest store of texts in existence that...

    Niladri Sekhar Dash, S. Arulmozi in History, Features, and Typology of Language Corpora (2018)

  16. No Access

    Chapter

    Definition of ‘Corpus’

    Understanding the concept of ‘corpus’ has been one of the challenging issues in corpus linguistics in recent times. Language users are often confused with the concept, and as a result of this, they sometimes c...

    Niladri Sekhar Dash, S. Arulmozi in History, Features, and Typology of Language Corpora (2018)

  17. No Access

    Chapter

    Genre of Text

    Classification of corpus based on genre is a difficult theoretical exercise which is carried out in this chapter. In this chapter, we have first justified why it is necessary to classify corpora based on certa...

    Niladri Sekhar Dash, S. Arulmozi in History, Features, and Typology of Language Corpora (2018)

  18. No Access

    Chapter

    Digital Text Corpora (Part 1)

    The history of digital text corpus generation and usage presents an interesting narrative. It shows how technology has brought about a resurgence in the discipline of linguistics, which was otherwise turning i...

    Niladri Sekhar Dash, S. Arulmozi in History, Features, and Typology of Language Corpora (2018)

  19. No Access

    Chapter

    Type and Purpose of Text

    The classification of the corpus is not confined to the genre and nature of texts. It spreads far beyond this. In this chapter, we have tried to show that a corpus can also be classified based on the type of t...

    Niladri Sekhar Dash, S. Arulmozi in History, Features, and Typology of Language Corpora (2018)

  20. No Access

    Chapter

    Digital Speech Corpora

    The history of speech corpus generation is comparatively short, slow and shady in comparison to text corpus generation. In fact, the diversity observed in text corpus generation is hardly noted in speech corpu...

    Niladri Sekhar Dash, S. Arulmozi in History, Features, and Typology of Language Corpora (2018)

Page of 3