Skip to content

Commit

Permalink
Update module_5.html
Browse files Browse the repository at this point in the history
  • Loading branch information
soberbichler authored Dec 2, 2024
1 parent 22a120f commit ad3279b
Showing 1 changed file with 14 additions and 15 deletions.
29 changes: 14 additions & 15 deletions modules/module_5.html
Original file line number Diff line number Diff line change
Expand Up @@ -46,27 +46,26 @@
<a href="../index.html" class="btn btn-secondary back-button">&larr; Go Back</a>

<div class="container">
<h1>Module 4: Large Language Models for Article Extraction and Post-OCR Correction</h1>
<h1>Module 5: Large Language Models for Article Extraction and Post-OCR Correction</h1>

<p>Module 3 will be all about Large Language models, prompting techniques and two specific NLP taks: article extraction and OCR post-correction</p>
<p>Module 5 will be all about Large Language models, prompting techniques and two specific NLP taks: article extraction and OCR post-correction</p>
<ul>
Large Language Models (LLMs) are artificial intelligence systems trained on massive text datasets that can process and generate human language based on the Transformer architecture introduced by Vaswear et al. in 2017. These models use neural networks to predict likely next tokens in a sequence, enabling tasks like text completion, translation, and question answering. While research shows correlations between model size, training data, and performance, specific capabilities and limitations continue to be actively studied and debated in the research community. They fundamentally operate through pattern matching rather than genuine understanding.
</ul>

<h3>Preparation for Module 5:</h3>
<ol>
<li>
<p>Read the article listed under literature below and prepare for class discussion:</p>
<ul>
<li>Why are machine learning methods called "Black Boxes"?</li>
<li>What does XAI stand for?</li>
<li>What is a self-attention mechanism?</li>
<li>Name a few methods to look into the "Black Box"</li>
<li>Create at least one more entry in the Glossary</li>
</ul>
</li>
<h3>Preparation for Module 5:</h3>
<ol>
<li>Watch (if not done already) this YouTube Video on LLMs: <a href="https://www.youtube.com/watch?v=LPZh9BOjkQs&list=PLZHQObOWTQDNU6R1_67000Dx_ZCJB-3pi&index=5">3Blue1Brown: Large Language Models</a></li>
<li>Inform yourself: What is Prompt Engineering and what kind of prompting techniques can you find?</li>
<li>Create an NVIDIA token:
<ol>
<li>Visit the <a href="https://build.nvidia.com/nvidia/llama-3_1-nemotron-70b-instruct">NVIDIA AI Playground</a></li>
<li>Click on login</li>
<li>Enter your University Email</li>
<li>Copy the token</li>
</ol>

</li>
</ol>
<h3>Literature:</h3>
<p class="citation">
Dobson, J.E. On reading and interpreting black box deep neural networks. Int J Digit Humanities 5, 431–449 (2023). <a href="https://doi.org/10.1007/s42803-023-00075-w" target="_blank">https://doi.org/10.1007/s42803-023-00075-w</a>
Expand Down

0 comments on commit ad3279b

Please sign in to comment.