找回密码
 新注册用户
搜索
查看: 5032|回复: 10

Nvidia GT300内部代号“费米”

[复制链接]
发表于 2009-9-29 15:24:03 | 显示全部楼层 |阅读模式
就像AMD的DX11显卡还有Evergreen、Cypress、Junpier等一系列内部代号,NVIDIA GT300也有一个内部名字“Fermi”,来自著名物理学家恩里科·费米

恩里科·费米(Enrico Fermi),生于1901年9月29日,卒于1954年11月28日,美籍意大利裔物理学家,1938年诺贝尔物理学奖获得者,现代物理学的最后一位通才,量子力学和量子场论的创立者之一,首创了弱相互作用(β衰变)的费米理论,负责设计建造了世界首座自持续链式裂变核反应堆,还是曼哈顿计划的主要领导者。以他的名字命名的有费米黄金定则、费米-狄拉克统计、费米子、费米面、费米液体及费米常数等等。

NVIDIA以这位近代杰出物理学家称呼自己的新一代图形核心,看来是要决心引发一场核地震了。一个绿色、一个爆炸——这似乎也从侧面反映了AMD、NVIDIA在显卡策略取向上的不同。

GT300的具体架构和规格依然是个谜,只知道是G80以来的最大规模架构变革,非常注重并行计算,支持DirectX 11、OpenGL 3.1和GDDR5显存,NVIDIA已经内部展示过首批样卡,应该能赶在今年年底发布,双芯版本也已在规划之中。


传Nvidia已经在向几家主要的合作伙伴展示“Fermi”显卡,按Nvidia的计划,产品在内部架构部 分进行了重大改进,不仅绘图性能优秀,而且还具备强大的并行计算能力。


“Fermi”将支持GDDR5显存,无论在芯片性能或是芯片面积方面均将超过对手HD5870.另据称“Fermi”的GX2版本也已经在开发进程之中。为了使“Fermi”的并行计算性能有质的飞跃,Nvidia对“Fermi”的内部架构进行了自G80架构以来的最大架构改进。

预计“Fermi”的核心/显存频率参数与ATI的HD5870显卡将十分接近,不过其内含流处理器数量等参数则仍属未知。
http://www.fudzilla.com/content/view/15713/1/

[ 本帖最后由 zglloo 于 2009-9-29 15:30 编辑 ]
回复

使用道具 举报

发表于 2009-9-29 20:07:50 | 显示全部楼层
期待。。。
回复

使用道具 举报

发表于 2009-9-29 20:48:40 | 显示全部楼层
GT300真费米,我口袋里的米难免就要被这显卡费了……
回复

使用道具 举报

发表于 2009-9-30 00:22:11 | 显示全部楼层

回复 #3 cicikml 的帖子

樓上的gag有點爛啊...

自g80以來最大的架構改造
上次是ps/vs到sp
今次可能連sp都變掉了~ @@
回复

使用道具 举报

发表于 2009-9-30 20:55:26 | 显示全部楼层
GPU specifications
This is the meat part you always want to read fist. So, here it how it goes:
* 3.0 billion transistors
* 40nm TSMC
* 384-bit memory interface
* 512 shader cores [renamed into CUDA Cores]
* 32 CUDA cores per Shader Cluster
* 1MB L1 cache memory [divided into 16KB Cache - Shared Memory]
* 768KB L2 unified cache memory
* Up to 6GB GDDR5 memory
* Half Speed IEEE 754 Double Precision


完全为了 GPGPU 而生。。

不知道最多的6GB GDDR5要怎么贴在卡上。。
回复

使用道具 举报

发表于 2009-9-30 21:22:28 | 显示全部楼层

回复 #5 BiscuiT 的帖子

6GB GDDR5?你有没有说错?
回复

使用道具 举报

发表于 2009-9-30 21:26:35 | 显示全部楼层

回复 #6 ledled 的帖子

是最大支持 6GB 。。

科学计算领域。。内存少玩毛麽。。
nv这次奋不顾身要杀入 HPC 领域了。。
回复

使用道具 举报

发表于 2009-9-30 22:08:28 | 显示全部楼层
nVidia GT300's Fermi architecture unveiled: 512 cores, up to 6GB GDDR5 9/30/2009 by: Theo Valich - Get more from this author
Just like we disclosed in the first article "nVidia GT300 specifications revealed – it's a cGPU!", nVidia GT300 chip is a computational beast like you have never seen before. In fact, we would go as far out and state that this is as closest as GPU can be to a CPU in the whole history of graphics technology. Now, time will tell whatever GT300 was the much needed revolution.

Beside the regular NV70 and GT300 codenames [codename for the GPU], nVidia's insiders called the GPU architecture - Fermi. Enrico Fermi was an Italian physicist who is credited with the invention of nuclear reactor. That brings us to one of codenames we heard for one of the GT300 board itself - "reactor".
When it comes to boards themselves, you can expect to see configurations with 1.5, 3.0 GB and 6GB of GDDR5 memory, but more on that a little bit later.

GPU specifications
This is the meat part you always want to read fist. So, here it how it goes:

3.0 billion transistors
40nm TSMC
384-bit memory interface
512 shader cores [renamed into CUDA Cores]
32 CUDA cores per Shader Cluster
1MB L1 cache memory [divided into 16KB Cache - Shared Memory]
768KB L2 unified cache memory
Up to 6GB GDDR5 memory
Half Speed IEEE 754 Double Precision


As you can read for yourself, the GT300 packs three billion transistors of silicon real estate, packing 16 Streaming Multiprocessor [new name for former Shader Cluster] in a single chip. Each of these sixteen multiprocessors packs 32 cores and this part is very important - we already disclosed future plans in terms to this cluster in terms of future applications. What makes a single unit important is the fact that it can execute an integer or a floating point instruction per clock per thread.

TSMC was in charge of manufacturing the three billion transistor mammoth, but it didn't stop there. Just like the G80 chip, nVidia GT300 packs six 64-bit memory controllers for a grand total of 384-bit, bringing back the odd memory capacity numbers. The memory controller is a GDDR5 native controller, which means it can take advantage of built-in ECC features inside the GDDR5 SDRAM memory and more importantly, GT300 can drive GDDR5 memory in the same manner as AMD can with its really good Radeon HD 5800 series. The additional two memory interfaces will have to wait until 28nm or 22nm full node shrinks, if we get to them with an essentially unchanged architecture. You can expect that the lower-end variants of GT300 architecture will pack less dense memory controller for more cost efficiency, especially on the memory side.

GPGPU is dead, cGPU lives!
Just like we reported earlier, GT300 changed the way how the GPU is functioning. If we compare it to the old GT200 architecture, comparisons are breathtaking. Fermi architecture operates at 512 Fused Multiply-Add [FMA] operations per clock in single precision mode, or 256 FMA per clock if you're doing double precision.
The interesting bit is the type of IEEE formats. In the past, nVidia supported IEEE 754-1985 floating point arithmetic, but with GT300, nVidia now supports the latest IEEE 754-2008 floating-point standard. Just like expected, GT300 chips will do all industry standards - allegedly with no tricks.

A GPU supports C++ natively?
Ferni architecture natively supports C [CUDA], C++, DirectCompute, DirectX 11, Fortran, OpenCL, OpenGL 3.1 and OpenGL 3.2. Now, you've read that correctly - Ferni comes with a support for native execution of C++. For the first time in history, a GPU can run C++ code with no major issues or performance penalties and when you add Fortran or C to that, it is easy to see that GPGPU-wise, nVidia did a huge job.

To implement ISA inside the GPU took a lot of bravery, and with GT200 project over and done with, the time came right to launch a chip that would be as flexible as developers wanted, yet affordable.

In a nutshell, this is just baseline information about what nVidia is going to introduce in the next couple of weeks. Without any doubt, we can see that nVidia reacted to Larrabee by introducing a part that is extremely efficient, natively support key industry standards and more importantly, doesn't cost an arm and a leg.

The line-up is consisted out of high-end consumer part [GeForce], commercial [Quadro] and scientific [Tesla]. You can expect memory sizes from 1.5GB for consumer GeForce 380 to 6GB for commercial Quadro and Tesla parts.
回复

使用道具 举报

发表于 2009-9-30 23:05:49 | 显示全部楼层
Half Speed IEEE 754 Double Precision
这已经很不容易了,GT200是1/8。

还有renamed into CUDA Cores,NV已经开始向普通玩家灌输通用计算的概念了。

[ 本帖最后由 cicikml 于 2009-9-30 23:07 编辑 ]
回复

使用道具 举报

发表于 2009-9-30 23:17:40 | 显示全部楼层

回复 #9 cicikml 的帖子

支持原生执行c++ 这才是最厉害。。

话说这根本不是面对普通玩家的东西。。nv已经奔去HPC方向了。。
回复

使用道具 举报

发表于 2009-9-30 23:32:59 | 显示全部楼层

回复 #10 BiscuiT 的帖子

這功能牛呀.....
回复

使用道具 举报

您需要登录后才可以回帖 登录 | 新注册用户

本版积分规则

论坛官方淘宝店开业啦~
欢迎大家多多支持基金会~

Archiver|手机版|小黑屋|中国分布式计算总站 ( 沪ICP备05042587号 )

GMT+8, 2024-6-3 17:33

Powered by Discuz! X3.5

© 2001-2024 Discuz! Team.

快速回复 返回顶部 返回列表