Part of the book series: International Series in Operations Research & Management Science ((ISOR,volume 354))

  • 119 Accesses

Abstract

This chapter presents large language models. It begins with LangChain model, and then it covers GPT-2 model and uses GPT model to classify social sustainability and economic sustainability classifiers from the data. The chapter also covers prompt engineering. Finally, it finishes GPT model on budget speeches as shown in Appendix A–F.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
EUR 29.95
Price includes VAT (France)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
EUR 119.83
Price includes VAT (France)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
EUR 147.69
Price includes VAT (France)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    What is Artificial Intelligence?—https://www.brookings.edu/research/what-is-artificial-intelligence/

  2. 2.

    What is Artificial Intelligence? An Informed Definition—https://emerj.com/ai-glossary-terms/what-is-artificial-intelligence-an-informed-definition/#:~:text=In%20%E2%80%9CArtificial%20Intelligence%3A%20A%20Modern,many%20different%20splintered%20fields%20%E2%80%93%20speech

  3. 3.

    Artificial Intelligence (AI)—https://www.gartner.com/en/information-technology/glossary/artificial-intelligence

  4. 4.

    What is deep learning?—https://www.ibm.com/topics/deep-learning

  5. 5.

    Model Architecture—https://colab.research.google.com/drive/10wVewWm6n4cdEdDZm3-DUKgVGeTMq9yz#scrollTo=c96103f9

  6. 6.

    Attention—https://colab.research.google.com/github/harvardnlp/annotated-transformer/blob/master/AnnotatedTransformer.ipynb

  7. 7.

    Annotated-transformer—https://github.com/harvardnlp/annotated-transformer/

  8. 8.

    Llama 2—https://huggingface.co/meta-llama/Llama-2-7b-chat-hf

  9. 9.

    Tensor Flow—https://github.com/tensorflow/tensor2tensor

  10. 10.

    Attention is all you need—http://nlp.seas.harvard.edu/2018/04/03/attention.html

  11. 11.

    Llama 2: AI Developers Handbook—https://www.pinecone.io/learn/llama-2/

  12. 12.

    Request access to the next version of Llama—https://ai.meta.com/resources/models-and-libraries/llama-downloads/

  13. 13.

    Grade School Math 8K—https://huggingface.co/datasets/gsm8k

  14. 14.

    Prompt Engineering—https://platform.openai.com/docs/guides/prompt-engineering

  15. 15.

    To Get Embeddings—https://platform.openai.com/docs/guides/embeddings/what-are-embeddings

  16. 16.

    What are embeddings?—https://platform.openai.com/docs/guides/embeddings/what-are-embeddings

  17. 17.

    Obtaining the embeddings—https://platform.openai.com/docs/guides/embeddings/use-cases

  18. 18.

    Budget 1992–93—https://www.indiabudget.gov.in/doc/bspeech/bs199293.pdf

  19. 19.

    THE BUDGET FOR THE YEAR 1990–91—https://www.indiabudget.gov.in/doc/bspeech/bs199091.pdf

  20. 20.

    Recursive Character Text Splitter—https://api.python.langchain.com/en/latest/text_splitter/langchain.text_splitter.RecursiveCharacterTextSplitter.html

  21. 21.

    langchain.chains.conversational_retrieval.base.ConversationalRetrievalChain—https://api.python.langchain.com/en/latest/chains/langchain.chains.conversational_retrieval.base.ConversationalRetrievalChain.html#

References

  1. Lutz Goedde, Joshua Katz, Alexandre Ménard, and Julien Revellat, “Agriculture’s connected future: How technology can yield new growth”, October 9, 2020, https://www.mckinsey.com/industries/agriculture/our-insights/agricultures-connected-future-how-technology-can-yield-new-growth, Access Date: 09/18/2020

  2. Sofia Bettiza, God and robots: Will AI transform religion?, 21 October 2021, https://www.bbc.com/news/av/technology-58983047, Access Date: June 11, 2023

  3. Dr. Prabhat Kumar, Artificial Intelligence: Resha** Life and Business, Chapter 2, BPB Publications; 1 edition (August 17, 2019), ISBN-10: 9388511077

    Google Scholar 

  4. Darrell M. West, “What is artificial intelligence?”, Thursday, October 4, 2018, https://www.brookings.edu/research/what-is-artificial-intelligence/, Access Date: November 22, 2020

  5. Daniel Faggella, “What is Artificial Intelligence? An Informed Definition”, December 21, 2018, https://emerj.com/ai-glossary-terms/what-is-artificial-intelligence-an-informed-definition/#:~:text=In%20%E2%80%9CArtificial%20Intelligence%3A%20A%20Modern,many%20different%20splintered%20fields%20%E2%80%93%20speech, Access Date: October 2020

  6. Russell Stuart and Peter Norvig, Artificial Intelligence: A Modern Approach (3rd edition), Publisher: Pearson; 3 edition (December 11, 2009), ISBN-10: 0136042597

    Google Scholar 

  7. Chandrasekar Vuppalapati (Author), Artificial Intelligence and Heuristics for Enhanced Food Security (International Series in Operations Research & Management Science, 331) 1st ed. 2022 Edition, Publisher: Springer; 1st ed. 2022 edition (September 17, 2022), ISBN-13: 978-3031087424

    Google Scholar 

  8. Jessica Lin Eamonn Keogh Stefano Lonardi Bill Chiu, A Symbolic Representation of Time Series, with Implications for Streaming Algorithms, June 13, 2003,https://www.cs.ucr.edu/~eamonn/SAX.pdf, Access Date: June 19, 2023

  9. Matt Taddy, Business Data Science: Combining Machine Learning and Economics to Optimize, Automate, and Accelerate Business Decisions 1st Edition, Publisher: McGraw-Hill Education; 1 edition (August 21, 2019), ISBN-10: 1260452778

    Google Scholar 

  10. Kai Hwang and Min Chen, “Big-Data Analytics for Cloud, IoT and Cognitive Computing”, Publisher: Wiley; 1 edition (August 14, 2017), ISBN-10: 9781119247029

    Google Scholar 

  11. Rani Horev, BERT Explained: State of the art language model for NLP, Nov 10 2018, https://towardsdatascience.com/bert-explained-state-of-the-art-language-model-for-nlp-f8b21a9b6270, Access Date: 11/24/2023

  12. Jacob Devlin Ming-Wei Chang Kenton Lee Kristina Toutanova, BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding, May 2019, https://arxiv.org/pdf/1810.04805.pdf, Access Date: 11/24/2023

  13. Fabio Chiusano, Two minutes NLP—GLUE Tasks and 2022 Leaderboard, Feb 28 2022, https://medium.com/nlplanet/two-minutes-nlp-glue-tasks-and-2022-leaderboard-517baedfa597#, Access Date: 11/24/2023

  14. Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, Illia Polosukhin, Attention is All you Need, 2017, https://papers.nips.cc/paper_files/paper/2017/hash/3f5ee243547dee91fbd053c1c4a845aa-Abstract.html, Access Date: 11/24/2023

  15. Dzmitry Bahdanau, Kyunghyun Cho, Yoshua Bengio, Neural Machine Translation by Jointly Learning to Align and Translate, Submitted on [1 Sep 2014 (v1), last revised 19 May 2016 (this version, v7)], https://arxiv.org/abs/1409.0473, Access Date: 11/24/2023

  16. Alex Graves, Generating Sequences With Recurrent Neural Networks, Submitted on [4 Aug 2013 (v1), last revised 5 Jun 2014 (this version, v5)], https://arxiv.org/abs/1308.0850, Access Date: 11/24/2023

  17. Jay Alammar, The Illustrated Transformer, June 27 2018, https://jalammar.github.io/illustrated-transformer/, Access Date: 11/24/2023

  18. Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei, Nikolay Bashlykov, Soumya Batra, Prajjwal Bhargava, Shruti Bhosale, Dan Bikel, Lukas Blecher, Cristian Canton Ferrer, Moya Chen, Guillem Cucurull, David Esiobu, Jude Fernandes, Jeremy Fu, Wenyin Fu, Brian Fuller, Cynthia Gao, Vedanuj Goswami, Naman Goyal, Anthony Hartshorn, Saghar Hosseini, Rui Hou, Hakan Inan, Marcin Kardas, Viktor Kerkez, Madian Khabsa, Isabel Kloumann, Artem Korenev, Punit Singh Koura, Marie-Anne Lachaux, Thibaut Lavril, Jenya Lee, Diana Liskovich, Yinghai Lu, Yuning Mao, Xavier Martinet, Todor Mihaylov, Pushkar Mishra, Igor Molybog, Yixin Nie, Andrew Poulton, Jeremy Reizenstein, Rashi Rungta, Kalyan Saladi, Alan Schelten, Ruan Silva, Eric Michael Smith, Ranjan Subramanian, **aoqing Ellen Tan, Binh Tang, Ross Taylor, Adina Williams, Jian **ang Kuan, Puxin Xu, Zheng Yan, Iliyan Zarov, Yuchen Zhang, Angela Fan, Melanie Kambadur, Sharan Narang, Aurelien Rodriguez, Robert Stojnic, Sergey Edunov, Thomas Scialom, Llama 2: Open Foundation and Fine-Tuned Chat Models, Submitted on 18 Jul 2023 (v1), last revised 19 Jul 2023 (this version, v2), https://arxiv.org/abs/2307.09288, Access Date: 11/25/2023

  19. Shri Manmohan Singh, Budget 1992-93, PUBLISHED TUE, 29th February 1992, https://www.indiabudget.gov.in/doc/bspeech/bs199293.pdf, Access Date: 11/23/2023

  20. Shri Manmohan Singh, Budget Speech 1991-1992, 24th July 1991,https://www.indiabudget.gov.in/doc/bspeech/bs199192.pdf, Access Date: November 26, 2023.

  21. James Briggs, Llama 2: AI Developers Handbook, 2020, https://www.pinecone.io/learn/llama-2/, Access Date: 11/24/2023

  22. Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter, Fei **a, Ed Chi, Quoc Le, Denny Zhou, Chain-of-Thought Prompting Elicits Reasoning in Large Language Models, Submitted on 28 Jan 2022 (v1), last revised 10 Jan 2023 (this version, v6)], https://arxiv.org/abs/2201.11903, Access Date: 11/24/2023

  23. Pranab Mukherjee, Budget 2012-2013, March 16, 2012, https://www.indiabudget.gov.in/doc/bspeech/bs201213.pdf, Access Date: December 07, 2023

  24. TUCKER ARRANTS, Text Generation with HuggingFace—GPT2, 2020, https://www.kaggle.com/code/tuckerarrants/text-generation-with-huggingface-gpt2, Access Date: 11/24/2023

  25. PROF. MADHU DANDAVATE, INTRODUCING THE BUDGET FOR THE YEAR 1990-91, PUBLISHED TUE, 1990, https://www.indiabudget.gov.in/doc/bspeech/bs199091.pdf, Access Date: 11/23/2023

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Vuppalapati, C. (2024). Large Language Models. In: Assessing Policy Effectiveness using AI and Language Models. International Series in Operations Research & Management Science, vol 354. Springer, Cham. https://doi.org/10.1007/978-3-031-56097-2_3

Download citation

Publish with us

Policies and ethics

Navigation