From ad3279b1dbf791dbee0ce2169a60aee282d93726 Mon Sep 17 00:00:00 2001
From: Sarah Oberbichler <66369271+soberbichler@users.noreply.github.com>
Date: Mon, 2 Dec 2024 03:47:10 +0100
Subject: [PATCH] Update module_5.html
---
modules/module_5.html | 29 ++++++++++++++---------------
1 file changed, 14 insertions(+), 15 deletions(-)
diff --git a/modules/module_5.html b/modules/module_5.html
index acbe8b4..ccc8eaf 100644
--- a/modules/module_5.html
+++ b/modules/module_5.html
@@ -46,27 +46,26 @@
← Go Back
-
Module 4: Large Language Models for Article Extraction and Post-OCR Correction
+
Module 5: Large Language Models for Article Extraction and Post-OCR Correction
-
Module 3 will be all about Large Language models, prompting techniques and two specific NLP taks: article extraction and OCR post-correction
+
Module 5 will be all about Large Language models, prompting techniques and two specific NLP taks: article extraction and OCR post-correction
Large Language Models (LLMs) are artificial intelligence systems trained on massive text datasets that can process and generate human language based on the Transformer architecture introduced by Vaswear et al. in 2017. These models use neural networks to predict likely next tokens in a sequence, enabling tasks like text completion, translation, and question answering. While research shows correlations between model size, training data, and performance, specific capabilities and limitations continue to be actively studied and debated in the research community. They fundamentally operate through pattern matching rather than genuine understanding.
-
Preparation for Module 5:
-
- -
-
Read the article listed under literature below and prepare for class discussion:
-
- - Why are machine learning methods called "Black Boxes"?
- - What does XAI stand for?
- - What is a self-attention mechanism?
- - Name a few methods to look into the "Black Box"
- - Create at least one more entry in the Glossary
-
-
+ Preparation for Module 5:
+
+ - Watch (if not done already) this YouTube Video on LLMs: 3Blue1Brown: Large Language Models
+ - Inform yourself: What is Prompt Engineering and what kind of prompting techniques can you find?
+ - Create an NVIDIA token:
+
+ - Visit the NVIDIA AI Playground
+ - Click on login
+ - Enter your University Email
+ - Copy the token
-
+
+
Literature:
Dobson, J.E. On reading and interpreting black box deep neural networks. Int J Digit Humanities 5, 431–449 (2023). https://doi.org/10.1007/s42803-023-00075-w