Member-only story
WEEKLY AI NEWS: RESEARCH, NEWS, RESOURCES, AND PERSPECTIVES
AI & ML news: Week 13 — 19 May
15 min readMay 20, 2024
New OpenAI GPT4-o model, Antropic expanding to Europe and much more
The most interesting news, repository, articles, and resources of the week
Check and star this repository where the news will be collected and indexed:
You will find the news first in GitHub. Single posts are also collected here:
Research
- Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models. Separately trained tokenizers are necessary for language models. Tokens that are never encountered during language model training may be produced by these. Even the most potent contemporary language models have a lot. This study investigates this phenomenon and offers solutions for locating and handling these tokens.
- Unlearning in Recommender Systems. With the use of a novel technique called E2URec, huge language model-based recommendation systems may now effectively and efficiently forget user data while maintaining privacy and speed.
- Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers. A project called Lumina seeks to provide a single text-to-X generation mechanism. Its training process involves interleaving text, video, audio, and pictures, which enhances downstream performance.
- MatterSim: A Deep Learning Atomistic Model Across Elements, Temperatures, and Pressures. In AI, simulators can be very effective tools for gathering…