<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Kenneths Blob &#187; Computer Architecture</title>
	<atom:link href="http://blog.langly.org/tag/computer-architecture/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.langly.org</link>
	<description>My rants about everything</description>
	<lastBuildDate>Wed, 28 Apr 2010 18:10:10 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>On GPU versus CPU cores.</title>
		<link>http://blog.langly.org/2009/11/17/gpu-vs-cpu-cores/</link>
		<comments>http://blog.langly.org/2009/11/17/gpu-vs-cpu-cores/#comments</comments>
		<pubDate>Tue, 17 Nov 2009 08:33:00 +0000</pubDate>
		<dc:creator>kenneo</dc:creator>
				<category><![CDATA[Computers]]></category>
		<category><![CDATA[GPU]]></category>
		<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Computer Architecture]]></category>

		<guid isPermaLink="false">http://blog.langly.org/?p=485</guid>
		<description><![CDATA[In the confusion surrounding the amount of cores on a GPU contra the number of cores I want to contribute with my part, and do some clearing up.
Since the modern day GPUs with support for general purpose calculations, it has been pushed that they contain several hundreds of &#8220;Cores&#8221;, a magnitude higher than the amount ]]></description>
			<content:encoded><![CDATA[<p>In the confusion surrounding the amount of cores on a GPU contra the number of cores I want to contribute with my part, and do some clearing up.</p>
<p>Since the modern day GPUs with support for general purpose calculations, it has been pushed that they contain several hundreds of &#8220;Cores&#8221;, a magnitude higher than the amount of &#8220;Cores&#8221; you can find in a regular CPU, which these days are about 2-4 depending on your version of CPU. Now, this is due to marketing only, and it has bothered me for a while seeing how academicians and computer engineers have started to pick up the term core, using it relentlessly. </p>
<p>Now, the problem arises due to two major facts. First, a the comparison between what the general CPU manufacturers calls a &#8220;Core&#8221; (Intel, AMD etc.), and that what GPU manufacturers ( Nvidia, AMD/ATI ) calls a core, is in fact two similar, but different things. Typically, the modern GPU normally consists of several &#8220;Cores&#8221;, as seen in the following illustration of the new Fermi architecture:</p>
<p><img src="http://techreport.com/r.x/nvidia-fermi/cuda-core.gif" alt="Fermi ARchitecture" /></p>
<p>What you can see in this photo is that the GPU core is a scaled down version of what the CPU manufacturers would call ALU, or a functional unit. Now, compare this to the microarchitecture of a Core2 chip, and check out the yellow boxes. If you want to compare the amount of real cores, these are the ones you have to look at. Furthermore, it&#8217;s important to remember that the Core2 even have a vector unit, which can multiply / add several operands at once.</p>
<p><img src="http://upload.wikimedia.org/wikipedia/commons/thumb/6/60/Intel_Core2_arch.svg/518px-Intel_Core2_arch.svg.png" width="400" alt="Core2" /></p>
<p>Thus, what might be a more fair comparison is the number of multiprocessors in the GPU contra the number of cores on a CPU. In the newest Fermi architecture this is 16, contra the 4 cores on a quad core processor. </p>
<p>However, this is still an unfair comparison. The reason why is due to the type of applications the different architectures are optimized for. Needless to say, the GPU is optimized for graphics processing and stream processing, which in turn is just to churn out data with fairly regular behaviour and memory accesses. Thus, the complexity of of the GPU has been scaled down compared to that of an CPU which has to perform better on a much wider range of applications. Hence, what happens is that the CPU has to use a lot more resources / gates on control structures, leaving the control to calculation gate ratio much higher than found in a GPU. This again, leads to the huge differences between the number of &#8220;cores&#8221; between the CPU and GPU. </p>
<p>As a sidenote, there are still a lot of applications where the CPU outperforms the GPU <img src='http://blog.langly.org/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /> </p>
]]></content:encoded>
			<wfw:commentRss>http://blog.langly.org/2009/11/17/gpu-vs-cpu-cores/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Fermi Architecture</title>
		<link>http://blog.langly.org/2009/10/01/fermi-architecture/</link>
		<comments>http://blog.langly.org/2009/10/01/fermi-architecture/#comments</comments>
		<pubDate>Thu, 01 Oct 2009 08:02:14 +0000</pubDate>
		<dc:creator>kenneo</dc:creator>
				<category><![CDATA[GPU]]></category>
		<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Computer Architecture]]></category>
		<category><![CDATA[Computing]]></category>

		<guid isPermaLink="false">http://blog.langly.org/?p=479</guid>
		<description><![CDATA[Some new and interesting articles about the new Fermi architecture from nvidia:
http://www.realworldtech.com/page.cfm?ArticleID=RWT093009110932&#38;p=1
and
http://techreport.com/articles.x/17670
Quite interesting to see how they are turning back to a more G80ish architecture again.
]]></description>
			<content:encoded><![CDATA[<p>Some new and interesting articles about the new Fermi architecture from nvidia:</p>
<p><a title="Inside Fermi: Nvidia's HPC Push" href="http://www.realworldtech.com/page.cfm?ArticleID=RWT093009110932&amp;p=1" target="_blank">http://www.realworldtech.com/page.cfm?ArticleID=RWT093009110932&amp;p=1</a></p>
<p>and</p>
<p><a href="http://techreport.com/articles.x/17670">http://techreport.com/articles.x/17670</a></p>
<p>Quite interesting to see how they are turning back to a more G80ish architecture again.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.langly.org/2009/10/01/fermi-architecture/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
