<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>allen institute for ai Archives - Stay Ahead with Heaptalk: Your Go-To Source for Business News</title>
	<atom:link href="https://heaptalk.com/tag/allen-institute-for-ai/feed/" rel="self" type="application/rss+xml" />
	<link>https://heaptalk.com/tag/allen-institute-for-ai/</link>
	<description>Latest business news media headlines platform today &#124; Lets talk Business</description>
	<lastBuildDate>Mon, 09 Oct 2023 03:50:20 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.8.5</generator>

<image>
	<url>https://heaptalk.com/wp-content/uploads/2025/04/cropped-fav-icon-48x48-1-32x32.png</url>
	<title>allen institute for ai Archives - Stay Ahead with Heaptalk: Your Go-To Source for Business News</title>
	<link>https://heaptalk.com/tag/allen-institute-for-ai/</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>AI2 introduces Dolma&#8217;s 3 trillion open dataset to train language models</title>
		<link>https://heaptalk.com/technology/ai2-introduces-dolmas-3-trillion-open-dataset-to-train-language-models/</link>
		
		<dc:creator><![CDATA[Sinta]]></dc:creator>
		<pubDate>Tue, 22 Aug 2023 03:45:42 +0000</pubDate>
				<category><![CDATA[News]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[AI2]]></category>
		<category><![CDATA[allen institute for ai]]></category>
		<category><![CDATA[dolma]]></category>
		<category><![CDATA[olmo]]></category>
		<guid isPermaLink="false">https://heaptalk.com/?p=11910</guid>

					<description><![CDATA[<p>Short for ‘Data to Feed OLMo&#8217;s Appetite’, Dolma dataset contains 3 trillion tokens derived from web content, academic publications, code, books, and encyclopedic materials. Heaptalk, Jakarta — Seattle-based non-profit research institute, The Allen Institute for AI (AI2), introduced a massive open dataset Dolma for training language models. The dataset is part of its open language [&#8230;]</p>
<p>The post <a href="https://heaptalk.com/technology/ai2-introduces-dolmas-3-trillion-open-dataset-to-train-language-models/">AI2 introduces Dolma&#8217;s 3 trillion open dataset to train language models</a> appeared first on <a href="https://heaptalk.com">Stay Ahead with Heaptalk: Your Go-To Source for Business News</a>.</p>
]]></description>
		
		
		
			</item>
	</channel>
</rss>
