Offline dengan aplikasi Player FM !
[QA] Synthetic continued pretraining
Manage episode 439457773 series 3524393
The paper proposes synthetic continued pretraining using EntiGraph to enhance language models' learning efficiency from small, domain-specific corpora by generating diverse text from salient entities.
https://arxiv.org/abs//2409.07431
YouTube: https://www.youtube.com/@ArxivPapers
TikTok: https://www.tiktok.com/@arxiv_papers
Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016
Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
--- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support
1649 episode
Manage episode 439457773 series 3524393
The paper proposes synthetic continued pretraining using EntiGraph to enhance language models' learning efficiency from small, domain-specific corpora by generating diverse text from salient entities.
https://arxiv.org/abs//2409.07431
YouTube: https://www.youtube.com/@ArxivPapers
TikTok: https://www.tiktok.com/@arxiv_papers
Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016
Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
--- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support
1649 episode
All episodes
×Selamat datang di Player FM!
Player FM memindai web untuk mencari podcast berkualitas tinggi untuk Anda nikmati saat ini. Ini adalah aplikasi podcast terbaik dan bekerja untuk Android, iPhone, dan web. Daftar untuk menyinkronkan langganan di seluruh perangkat.