T5 and large language models: The good, the bad, and the ugly Disclaimer Do large language models memorize their training data? Links T5 Source Code mT5: A Massively Multilingual Pre-trained Text-to-Text Transformer How Much Knowledge Can You Pack Into the Parameters of a Language Model? Extracting Training Data from Large Language Models Do Transformer Modifications Transfer Across Implementations and Applications?