Large Language Models

Vuppalapati, Chandrasekar

doi:10.1007/978-3-031-56097-2_3

Chandrasekar Vuppalapati¹⁰

Part of the book series: International Series in Operations Research & Management Science ((ISOR,volume 354))

119 Accesses

Abstract

This chapter presents large language models. It begins with LangChain model, and then it covers GPT-2 model and uses GPT model to classify social sustainability and economic sustainability classifiers from the data. The chapter also covers prompt engineering. Finally, it finishes GPT model on budget speeches as shown in Appendix A–F.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: EUR 29.95; Price includes VAT (France)

eBook: EUR 119.83; Price includes VAT (France)

Hardcover Book: EUR 147.69; Price includes VAT (France)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
What is Artificial Intelligence?—https://www.brookings.edu/research/what-is-artificial-intelligence/
2.
What is Artificial Intelligence? An Informed Definition—https://emerj.com/ai-glossary-terms/what-is-artificial-intelligence-an-informed-definition/#:~:text=In%20%E2%80%9CArtificial%20Intelligence%3A%20A%20Modern,many%20different%20splintered%20fields%20%E2%80%93%20speech
3.
Artificial Intelligence (AI)—https://www.gartner.com/en/information-technology/glossary/artificial-intelligence
4.
What is deep learning?—https://www.ibm.com/topics/deep-learning
5.
Model Architecture—https://colab.research.google.com/drive/10wVewWm6n4cdEdDZm3-DUKgVGeTMq9yz#scrollTo=c96103f9
6.
Attention—https://colab.research.google.com/github/harvardnlp/annotated-transformer/blob/master/AnnotatedTransformer.ipynb
7.
Annotated-transformer—https://github.com/harvardnlp/annotated-transformer/
8.
Llama 2—https://huggingface.co/meta-llama/Llama-2-7b-chat-hf
9.
Tensor Flow—https://github.com/tensorflow/tensor2tensor
10.
Attention is all you need—http://nlp.seas.harvard.edu/2018/04/03/attention.html
11.
Llama 2: AI Developers Handbook—https://www.pinecone.io/learn/llama-2/
12.
Request access to the next version of Llama—https://ai.meta.com/resources/models-and-libraries/llama-downloads/
13.
Grade School Math 8K—https://huggingface.co/datasets/gsm8k
14.
Prompt Engineering—https://platform.openai.com/docs/guides/prompt-engineering
15.
To Get Embeddings—https://platform.openai.com/docs/guides/embeddings/what-are-embeddings
16.
What are embeddings?—https://platform.openai.com/docs/guides/embeddings/what-are-embeddings
17.
Obtaining the embeddings—https://platform.openai.com/docs/guides/embeddings/use-cases
18.
Budget 1992–93—https://www.indiabudget.gov.in/doc/bspeech/bs199293.pdf
19.
THE BUDGET FOR THE YEAR 1990–91—https://www.indiabudget.gov.in/doc/bspeech/bs199091.pdf
20.
Recursive Character Text Splitter—https://api.python.langchain.com/en/latest/text_splitter/langchain.text_splitter.RecursiveCharacterTextSplitter.html
21.
langchain.chains.conversational_retrieval.base.ConversationalRetrievalChain—https://api.python.langchain.com/en/latest/chains/langchain.chains.conversational_retrieval.base.ConversationalRetrievalChain.html#

References

Lutz Goedde, Joshua Katz, Alexandre Ménard, and Julien Revellat, “Agriculture’s connected future: How technology can yield new growth”, October 9, 2020, https://www.mckinsey.com/industries/agriculture/our-insights/agricultures-connected-future-how-technology-can-yield-new-growth, Access Date: 09/18/2020
Sofia Bettiza, God and robots: Will AI transform religion?, 21 October 2021, https://www.bbc.com/news/av/technology-58983047, Access Date: June 11, 2023
Dr. Prabhat Kumar, Artificial Intelligence: Resha** Life and Business, Chapter 2, BPB Publications; 1 edition (August 17, 2019), ISBN-10: 9388511077
Google Scholar
Darrell M. West, “What is artificial intelligence?”, Thursday, October 4, 2018, https://www.brookings.edu/research/what-is-artificial-intelligence/, Access Date: November 22, 2020
Daniel Faggella, “What is Artificial Intelligence? An Informed Definition”, December 21, 2018, https://emerj.com/ai-glossary-terms/what-is-artificial-intelligence-an-informed-definition/#:~:text=In%20%E2%80%9CArtificial%20Intelligence%3A%20A%20Modern,many%20different%20splintered%20fields%20%E2%80%93%20speech, Access Date: October 2020
Russell Stuart and Peter Norvig, Artificial Intelligence: A Modern Approach (3rd edition), Publisher: Pearson; 3 edition (December 11, 2009), ISBN-10: 0136042597
Google Scholar
Chandrasekar Vuppalapati (Author), Artificial Intelligence and Heuristics for Enhanced Food Security (International Series in Operations Research & Management Science, 331) 1st ed. 2022 Edition, Publisher: Springer; 1st ed. 2022 edition (September 17, 2022), ISBN-13: 978-3031087424
Google Scholar
Jessica Lin Eamonn Keogh Stefano Lonardi Bill Chiu, A Symbolic Representation of Time Series, with Implications for Streaming Algorithms, June 13, 2003,https://www.cs.ucr.edu/~eamonn/SAX.pdf, Access Date: June 19, 2023
Matt Taddy, Business Data Science: Combining Machine Learning and Economics to Optimize, Automate, and Accelerate Business Decisions 1st Edition, Publisher: McGraw-Hill Education; 1 edition (August 21, 2019), ISBN-10: 1260452778
Google Scholar
Kai Hwang and Min Chen, “Big-Data Analytics for Cloud, IoT and Cognitive Computing”, Publisher: Wiley; 1 edition (August 14, 2017), ISBN-10: 9781119247029
Google Scholar
Rani Horev, BERT Explained: State of the art language model for NLP, Nov 10 2018, https://towardsdatascience.com/bert-explained-state-of-the-art-language-model-for-nlp-f8b21a9b6270, Access Date: 11/24/2023
Jacob Devlin Ming-Wei Chang Kenton Lee Kristina Toutanova, BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding, May 2019, https://arxiv.org/pdf/1810.04805.pdf, Access Date: 11/24/2023
Fabio Chiusano, Two minutes NLP—GLUE Tasks and 2022 Leaderboard, Feb 28 2022, https://medium.com/nlplanet/two-minutes-nlp-glue-tasks-and-2022-leaderboard-517baedfa597#, Access Date: 11/24/2023
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, Illia Polosukhin, Attention is All you Need, 2017, https://papers.nips.cc/paper_files/paper/2017/hash/3f5ee243547dee91fbd053c1c4a845aa-Abstract.html, Access Date: 11/24/2023
Dzmitry Bahdanau, Kyunghyun Cho, Yoshua Bengio, Neural Machine Translation by Jointly Learning to Align and Translate, Submitted on [1 Sep 2014 (v1), last revised 19 May 2016 (this version, v7)], https://arxiv.org/abs/1409.0473, Access Date: 11/24/2023
Alex Graves, Generating Sequences With Recurrent Neural Networks, Submitted on [4 Aug 2013 (v1), last revised 5 Jun 2014 (this version, v5)], https://arxiv.org/abs/1308.0850, Access Date: 11/24/2023
Jay Alammar, The Illustrated Transformer, June 27 2018, https://jalammar.github.io/illustrated-transformer/, Access Date: 11/24/2023
Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei, Nikolay Bashlykov, Soumya Batra, Prajjwal Bhargava, Shruti Bhosale, Dan Bikel, Lukas Blecher, Cristian Canton Ferrer, Moya Chen, Guillem Cucurull, David Esiobu, Jude Fernandes, Jeremy Fu, Wenyin Fu, Brian Fuller, Cynthia Gao, Vedanuj Goswami, Naman Goyal, Anthony Hartshorn, Saghar Hosseini, Rui Hou, Hakan Inan, Marcin Kardas, Viktor Kerkez, Madian Khabsa, Isabel Kloumann, Artem Korenev, Punit Singh Koura, Marie-Anne Lachaux, Thibaut Lavril, Jenya Lee, Diana Liskovich, Yinghai Lu, Yuning Mao, Xavier Martinet, Todor Mihaylov, Pushkar Mishra, Igor Molybog, Yixin Nie, Andrew Poulton, Jeremy Reizenstein, Rashi Rungta, Kalyan Saladi, Alan Schelten, Ruan Silva, Eric Michael Smith, Ranjan Subramanian, **aoqing Ellen Tan, Binh Tang, Ross Taylor, Adina Williams, Jian **ang Kuan, Puxin Xu, Zheng Yan, Iliyan Zarov, Yuchen Zhang, Angela Fan, Melanie Kambadur, Sharan Narang, Aurelien Rodriguez, Robert Stojnic, Sergey Edunov, Thomas Scialom, Llama 2: Open Foundation and Fine-Tuned Chat Models, Submitted on 18 Jul 2023 (v1), last revised 19 Jul 2023 (this version, v2), https://arxiv.org/abs/2307.09288, Access Date: 11/25/2023
Shri Manmohan Singh, Budget 1992-93, PUBLISHED TUE, 29th February 1992, https://www.indiabudget.gov.in/doc/bspeech/bs199293.pdf, Access Date: 11/23/2023
Shri Manmohan Singh, Budget Speech 1991-1992, 24th July 1991,https://www.indiabudget.gov.in/doc/bspeech/bs199192.pdf, Access Date: November 26, 2023.
James Briggs, Llama 2: AI Developers Handbook, 2020, https://www.pinecone.io/learn/llama-2/, Access Date: 11/24/2023
Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter, Fei **a, Ed Chi, Quoc Le, Denny Zhou, Chain-of-Thought Prompting Elicits Reasoning in Large Language Models, Submitted on 28 Jan 2022 (v1), last revised 10 Jan 2023 (this version, v6)], https://arxiv.org/abs/2201.11903, Access Date: 11/24/2023
Pranab Mukherjee, Budget 2012-2013, March 16, 2012, https://www.indiabudget.gov.in/doc/bspeech/bs201213.pdf, Access Date: December 07, 2023
TUCKER ARRANTS, Text Generation with HuggingFace—GPT2, 2020, https://www.kaggle.com/code/tuckerarrants/text-generation-with-huggingface-gpt2, Access Date: 11/24/2023
PROF. MADHU DANDAVATE, INTRODUCING THE BUDGET FOR THE YEAR 1990-91, PUBLISHED TUE, 1990, https://www.indiabudget.gov.in/doc/bspeech/bs199091.pdf, Access Date: 11/23/2023

Download references

Author information

Authors and Affiliations

Computer Engineering, San Jose State University, San Jose, CA, USA
Chandrasekar Vuppalapati

Authors

Chandrasekar Vuppalapati
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Vuppalapati, C. (2024). Large Language Models. In: Assessing Policy Effectiveness using AI and Language Models. International Series in Operations Research & Management Science, vol 354. Springer, Cham. https://doi.org/10.1007/978-3-031-56097-2_3

Download citation

DOI: https://doi.org/10.1007/978-3-031-56097-2_3
Published: 31 May 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-56096-5
Online ISBN: 978-3-031-56097-2
eBook Packages: Business and ManagementBusiness and Management (R0)

Publish with us

Policies and ethics