The 10 core Westmere-EX (Xeon E7-!800) MP server processor [10]
3. Heterogeneous multicores (1)
3.2 Add-on type multicores (1)
3.2.1.4 Example 2: Intel’s on-die integrated CPU/GPUs (8)
© Sima Dezső, ÓE NIK 148 www.tankonyvtar.hu
4. Outlook
4. Outlook (1)
4. Outlook
Heterogenous multicores
1(Ma):M(S) 2(Ma):M(S) M(Ma):M(S) 1(CPU):1(D) M(CPU):1(D) M(CPU):M(D)
Ma: Master S: Slave M: Many
D: Dedicated (like GPU) H: Homogenous
M: Many
M(Ma) = M(CPU) M(S) M(D)
Figure 4.1: Expected evolution of heterogeneous multicores
Master/slave
type multicores Add-on
type multicores
=
© Sima Dezső, ÓE NIK 150 www.tankonyvtar.hu
Master-slave type multicores require much more intricate workflow control and synchronization than add-on type multicores
It can expected that add-on type multicores will dominate the future of heterogeneous multicores.
4. Outlook (2)
5. References
© Sima Dezső, ÓE NIK 152 www.tankonyvtar.hu
References (1)
[1]: Gilbert J. D., Hunt S. H., Gunadi D., Srinivas G., The Tulsa Processor: A Dual Core Large Shared-Cache Intel Xeon Processor 7000 Sequence for the MP Server Market Segment,
Aug 21 2006, http://www.hotchips.org/archives/hc18/3_Tues/HC18.S9/HC18.S9T1.pdf [2]: Intel Server Board Set SE8500HW4, Technical Product Specification, Revision 1.0, May 2005, ftp://download.intel.com/support/motherboards/server/sb/se8500hw4_board_
set_tpsr10.pdf
[3]: Intel® E8501 Chipset North Bridge (NB) Datasheet, Mai 2006, http://www.intel.com/design/chipsets/e8501/datashts/309620.htm
[4]: Supermicro Motherboards, http://www.supermicro.com/products/motherboard/
[5]: Next-Generation AMD Opteron Processor with Direct Connect Architecture – 4P Server Comparison, http://www.amd.com/us-en/assets/content_type/DownloadableAssets/4P_
Server_Comparison_PID_41461.pdf
[6]: Supermicro P4QH6 / P4QH8 User’s Manual, 2002,
http://www.supermicro.com/manuals/motherboard/GC-HE/MNL-0665.pdf
[7]: Intel® 7300 Chipset Memory Controller Hub (MCH) – Datasheet, Sept. 2007, http://www.intel.com/design/chipsets/datashts/313082.htm
[8]: Quad-Core Intel® Xeon® Processor 7300 Series Product Brief, Intel, Nov. 2007 http://download.intel.com/products/processor/xeon/7300_prodbrief.pdf
[9]: Mitchell D., Intel Nehalem-EX review, PCPro,
http://www.pcpro.co.uk/reviews/processors/357709/intel-nehalem-ex
[10]: Nagaraj D., Kottapalli S.: Westmere-EX: A 20 thread server CPU, Hot Chips 2010
http://www.hotchips.org/uploads/archive22/HC22.24.610-Nagara-Intel-6-Westmere-EX.pdf
References (2)
[11]: Intel Xeon Processor E7-8800/4800/2800 Product Families, Datasheet Vol. 1 of 2, April 2011, http://www.intel.com/Assets/PDF/datasheet/325119.pdf
[12]: Intel Xeon Processor 7500/6500 Series, Public Gold Presentation, March 30 2010, http://cache-www.intel.com/cd/00/00/44/64/446456_446456.pdf
[13]: Supermicro X8QB6-F / X8QBE-F User’s Manual, 2010,
http://files.siliconmechanics.com/Documentation/Rackform/iServ/R413/Mainboard/MNL -X8QB-E-6-F.pdf
[14]: Mattson T., The Future of Many Core Computing: A tale of two processors, March 4 2010, http://og-hpc.com/Rice2010/Slides/Mattson-OG-HPC-2010-Intel.pdf
[15]: Kirsch N., An Overview of Intel's Teraflops Research Chip, Febr. 13 2007, Legit Reviews, http://www.legitreviews.com/article/460/1/
[16]: Rattner J., „Single-chip Cloud Computer”, Dec. 2 2009
http://www.pcper.com/reviews/Processors/Intel-Shows-48-core-x86-Processor-Single- chip-Cloud-Computer
[17]: Nvidia CUDA C Programming Guide, Version 3.2, October 22 2010
http://developer.download.nvidia.com/compute/cuda/3_2/toolkit/docs/CUDA_C_
Programming_Guide.pdf
[18]: Chu M. M., GPU Computing: Past, Present and Future with ATI Stream Technology, AMD, March 9 2010,
http://developer.amd.com/gpu_assets/GPU%20Computing%20-%20Past%20 Present%20and%20Future%20with%20ATI%20Stream%20Technology.pdf
© Sima Dezső, ÓE NIK 154 www.tankonyvtar.hu
References (3)
[19]: Hwu W., Kirk D., Nvidia, Advanced Algorithmic Techniques for GPUs, Berkeley, January 24-25 2011
http://iccs.lbl.gov/assets/docs/20110124/lecture1_computational_thinking_Berkeley_2011.pdf [20]: Shrout R., Nvidia GT200 Revealed – GeForce GTX 280 and GTX 260 Review,
PC Perspective, June 16 2008,
http://www.pcper.com/article.php?aid=577&type=expert&pid=3
[21]: Nvidia CUDA Compute Unified Device Architecture Programming Guide, Version 2.0, June 2008, Nvidia,
http://developer.download.nvidia.com/compute/cuda/2_0/docs/NVIDIA_CUDA_Programming Guide_2.0.pdf
[22]: Next Gen CUDA GPU Architecture, Code-Named “Fermi”, Press Presentation at Nvidia’s 2009 GPU Technology Conference, (GTC), Sept. 30 2009,
http://www.nvidia.com/object/gpu_tech_conf_press_room.html
[23]: Nvidia’s Next Generation CUDATM Compute Architecture: FermiTM, Version 1.1, 2009 http://www.nvidia.com/content/PDF/fermi_white_papers/NVIDIA_Fermi_Compute_
Architecture_Whitepaper.pdf
[24]: Kanter D., Inside Fermi: Nvidia's HPC Push, Real World Technologies Sept 30 2009,
http://www.realworldtech.com/includes/templates/articles.cfm?ArticleID=RWT093009110932&
mode=print
[25]: Kirsch N., NVIDIA GF100 Fermi Architecture and Performance Preview, Legit Reviews, Jan 20 2010, http://www.legitreviews.com/article/1193/2/
References (4)
[26]: Wasson S., Inside Fermi: Nvidia's 'Fermi' GPU architecture revealed, Tech Report, Sept 30 2009, http://techreport.com/articles.x/17670/1
[27]: Glaskowsky P. N., Nvidia’s Fermi: The First Complete GPU Computing Architecture Sept 2009, http://www.nvidia.com/content/PDF/fermi_white_papers/
P.Glaskowsky_NVIDIA's_Fermi-The_First_Complete_GPU_Architecture.pdf
[28]: Kanter D., “NVIDIA’s GT200: Inside a Parallel Processor,” Real World Technologies, Sept. 8 2008, http://www.realworldtech.com/page.cfm?ArticleID=RWT090808195242 [29]: Hoenig M., Nvidia GeForce 580 Review, HardwareCanucks, Nov. 8, 2010,
http://www.hardwarecanucks.com/forum/hardware-canucks-reviews/37789-nvidia- geforce-gtx-580-review-5.html
[30]: Wasson S., Nvidia's GeForce GTX 480 and 470 graphics processors, Tech Report, March 31 2010, http://techreport.com/articles.x/18682
[31]: Piazza T., Dr. Jiang H., Microarchitecture Codename Sandy Bridge: Processor Graphics, Presentation ARCS002, IDF San Francisco, Sept. 2010
[32]: Kahn O., Valentine B., Microarchitecture Codename Sandy Bridge: New Processor Innovations, Presentation ARCS001, IDF San Francisco Sept. 2010
[33]: Hagedoorn H. Mohammad S., Barling I. R., Core i5 2500K and Core i7 2600K review, Jan. 3 2011,
http://www.guru3d.com/article/core-i5-2500k-and-core-i7-2600k-review/2 [34]: Intel Sandy Bridge Review, Bit-tech, Jan. 3 2011,
http://www.bit-tech.net/hardware/cpus/2011/01/03/intel-sandy-bridge-review/1
© Sima Dezső, ÓE NIK 156 www.tankonyvtar.hu
References (5)
[35]: Wikipedia: Intel GMA, 2011, http://en.wikipedia.org/wiki/Intel_GMA
[36]: Shimpi A. L., The Sandy Bridge Review: Intel Core i7-2600K, i5-2500K and Core i3-2100 Tested, AnandTech, Jan. 3 2011,
http://www.anandtech.com/show/4083/the-sandy-bridge-review-intel-core-i7-600k-i5-2500k-core-i3-2100-tested/11
[37]: Wall D. W.: Limits of Instruction Level Parallelism, WRL TN-15, Dec. 1990 [38]: Bhandarkar D.: „The Dawn of a New Era”, Presentation EMEA, May 11 2006.
[39]: Moore G. E., No Exponential is Forever… ISSCC 2003,
http://download.intel.com/research/silicon/Gordon_Moore_ISSCC_021003.pdf [40]: Intel Roadmap 2006, Source Intel
[41]: Davis E.: Tera Tera Tera Presentation, 2008
http://bt.pa.msu.edu/TM/BocaRaton2006/talks/davis.pdf
[42]: Stokes J.: Clearing up the confusion over Intel’s Larrabee, Part II
http://arstechnica.com/hardware/news/2007/06/clearing-up-the-confusion-over-intels- larrabee-part-ii.ars
[43]: Taylor M. B. & all: Evaluation of the Raw Microprocessor, Proc. ISCA 2004 http://groups.csail.mit.edu/cag/raw/documents/raw_isca_2004.pdf
[44]: Wright C., Henning P.: Roadrunner Tutorial, An Introduction to Roadrunner and the Cell Processor, Febr. 7 2008,
http://ebookbrowse.com/roadrunner-tutorial-session-1-web1-pdf-d34334105
References (6)
[45]: Seguin S.: IBM Roadrunner Beats Cray’s Jaguar, Tom’s Hardware, Nov. 18 2008 http://www.tomshardware.com/news/IBM-Roadrunner-Top500-Supercomputer,6610.html
© Sima Dezső, ÓE NIK 158 www.tankonyvtar.hu