If this is your first visit, be sure to check out the FAQ by clicking the link above. You may have to register before you can post: click the register link above to proceed. To start viewing messages, select the forum that you want to visit from the selection below.
![]() |
|
|
#1 |
|
Regular
Join Date: Jan 2008
Posts: 354
|
For those of you wondering about the changes in the IVB graphics architecture, I have a deep dive that compares IVB to SNB and discusses the details of how the GPU was improved:
http://www.realworldtech.com/page.cf...WT042212225031 Thanks to Willard for posting on the front page! DK
__________________
www.realworldtech.com |
|
|
|
|
|
#2 | |
|
Member
Join Date: Mar 2004
Posts: 751
|
I find this quote interesting from the AnandTech review of the Intel Ivy Bridge.
Quote:
__________________
Never Argue With An Idiot. They'll Lower You To Their Level And Then Beat You With Experience! |
|
|
|
|
|
|
#3 |
|
Darlek ******
Join Date: Jun 2004
Posts: 9,489
|
"Unfortunately at the time only Apple was interested in a hypothetical Ivy Bridge GT3 and rumor has it that Otellini wasn't willing to make a part that only one OEM would buy in large quantities."
surely if its faster than what amd offer it will sell ? apple is unique in oem's as it builds not just systems but operating systems and applications so they would care about capabilities, other oem's just care about price and is it desireable to end users (ie: bang for buck)
__________________
Guardian of the Most holy Two Terabytes of Gaming Goodness™ |
|
|
|
|
|
#4 |
|
Regular
Join Date: Jan 2008
Posts: 354
|
A GT3 part would require a lot more effort. I also wonder at what point do you start to get memory bandwidth limited...
David
__________________
www.realworldtech.com |
|
|
|
|
|
#5 |
|
Member
Join Date: Jun 2004
Posts: 168
|
Even if Anandtech predict Haswell GT3 will offer 3x the performance of Ivy it is still no where near what discreet would be able to do. And since Discrete Graphics having gone through 2 cycle of design based on power efficiency, they can now idle at very low power.
So we got back to the questions, why should we ( end user ) want or need Integrated Graphics? |
|
|
|
|
|
#6 |
|
Darlek ******
Join Date: Jun 2004
Posts: 9,489
|
price
__________________
Guardian of the Most holy Two Terabytes of Gaming Goodness™ |
|
|
|
|
|
#7 |
|
Senior Member
Join Date: Sep 2003
Location: Well within 3d
Posts: 4,071
|
Discrete cards are the additional component to the system, while the IGP is there by default.
The add-in board is what needs to justify itself. For those who want the performance, upgradability, and a broader and somewhat fresher set of secondary features, the cards are justifiable. For the vast majority of systems, the argument for moving beyond an IGP weakens the less the user demands of the system. Ivy Bridge's slow evolution for the IGP means that the read/write paths are separate, something AMD has only just moved past for GCN. At some point, it seems Intel would move past this. Trinity and either this or Intel's next IGP may be the last examples of the split memory pipeline. With the programmability aspects being fleshed out, the thing that seems more important is how effectively stacked DRAM or interposer connections can begin to eat into the discrete board's memory bandwidth advantage, and how quickly the various competitors can get to that point.
__________________
Dreaming of a .065 micron etch-a-sketch. |
|
|
|
|
|
#8 |
|
Nutella Nutellae
Join Date: Feb 2002
Location: San Francisco
Posts: 4,297
|
What do you mean by separate read/write paths?
__________________
[twitter] More samples, we need more samples! [Dean Calver] The opinions expressed herein are my own personal opinions and do not represent my employer's view in any way |
|
|
|
|
|
#9 |
|
Senior Member
Join Date: Sep 2003
Location: Well within 3d
Posts: 4,071
|
The memory pipeline has read-only paths for the L1, L2, and L3 caches.
GCN now has a read/write capability down the same path.
__________________
Dreaming of a .065 micron etch-a-sketch. |
|
|
|
|
|
#10 | |
|
Senior Member
Join Date: Oct 2002
Posts: 2,434
|
Quote:
|
|
|
|
|
|
|
#11 | |
|
Senior Member
|
Quote:
Discretes can use interposers too. Probably more expensive interposers since they are now limited to higher price points. |
|
|
|
|
|
|
#12 |
|
Senior Member
Join Date: Sep 2003
Location: Well within 3d
Posts: 4,071
|
The CPU memory bus has additional constraints that weigh it down, thanks to the multidrop bus and the number of discontinuities from the CPU to socket to motherboard to slot to DIMM.
A discrete board could still hold the advantage, but it may not be the near order of magnitude between a desktop processor and an enthusiast video card. The larger power envelope remains an advantage, for now. However, I remember when Intel introduced the BTX spec, to betther handle heat from the CPU socket. The primary need evaporated when more efficient chips than Prescott came about. The funny thing is that back in the days when BTX was mocked for trying to cater to an overheated CPU burning north of 100 watts, GPUs weren't 300 Watt monstrosities with blowers. Ivy Bridge didn't signficantly change the level of integration between the CPU and GPU portions, but it was hinted that the next round will be different. Once memory spaces become shared with an Intel design or AMD's avowed goal with its heterogenous compute model, the discrete board's real weak point as a slave device spanning a high latency bus will become more difficult to hide. Once the GPU is on a interposer, why keep it on the far side of an expansion bus, or out of a socket? Was something like the BTX thermal module so bad now that we have graphics boards taking up two or three expansion slots?
__________________
Dreaming of a .065 micron etch-a-sketch. |
|
|
|
|
|
#13 |
|
Regular
Join Date: Jan 2008
Posts: 354
|
Today, the fundamental advantage for discrete GPUs is a larger power budget and dedicated memory. Looking out 5 years, I think only the larger power budget will remain.
Sure, Intel and AMD might not throw down 400mm2 on an IGP...and dedicated GPUs will probably have more memory bandwidth, but those are largely cost driven constraints. It's a matter of wanting to expand into higher cost markets, and that desire probably isn't there. DK
__________________
www.realworldtech.com |
|
|
|
|
|
#14 | |
|
Senior Member
|
Quote:
I think discretes will have a bw advantage since they don't have to share it with a CPU and since they tend to employ larger die sizes, I am guessinig they will be in a position to afford wider mem buses, even on an interposer. And who knows, may be adding a small CPU core or two (bobcat ish) on a discrete might not be such a bad idea after all. |
|
|
|
|
|
|
#15 |
|
Nutella Nutellae
Join Date: Feb 2002
Location: San Francisco
Posts: 4,297
|
For what data types? UAVs?
__________________
[twitter] More samples, we need more samples! [Dean Calver] The opinions expressed herein are my own personal opinions and do not represent my employer's view in any way |
|
|
|
|
|
#16 |
|
Senior Member
Join Date: Sep 2003
Location: Well within 3d
Posts: 4,071
|
Untyped read/write/atomic with MUBUF, image read/write/atomic with MIMG.
MTBUF has read/write for typed buffers, with the type being dictated by a resource constant. The AMD presentation doesn't list out atomics for this one, and that does sound like it could be used for a UAV with its lack of ordering. Nvidia's graphics export pipe is the most integrated with the cache hierarchy, since the ROPs use the L2. AMD seems to be less so since GDS and graphics export have a side path and the ROPs are separate. IVB looks at a higher level to resemble an earlier AMD GPU, possibly before the introduction of that little UAV cache the preceded the R/W cache hierarchy. The ROP path seems specialized enough to keep a separation for all three. Nvidia's done the most to update the graphics domain, hence why it seems the ROP path is the most tightly integrated. AMD's compute side has been overhauled, but it seems like its current design has compromised on a a CU array that prioritizes each CU being able to serve different compute clients. The modestly evolved graphics domain exists at a slight remove, with the specialized export bus between the freer compute array and the ordered ROP and GDS hardware. Perhaps Intel hasn't opted for closing the loop yet because of the cost involved in making the leap, and because it's really not hurting as badly for compute performance thanks to its CPU dominance. They may do it because the shrinking volume of the discrete market may make it too expensive to have a GPU-only chip. There may be a range of APUs, with some having a very high balance of GPU capability. Perhaps a gamer system with dual sockets, one heavy on the CPU, the other on GPU?
__________________
Dreaming of a .065 micron etch-a-sketch. |
|
|
|
|
|
#17 | |
|
Member
Join Date: Sep 2010
Posts: 1,002
|
Quote:
It's not price but I think it's fusion the right answer. In future when you won't be able to recognise what the classic CPU and classic graphics part of the chip are, then this will be very helpful and accelarating all kinds of compute. |
|
|
|
|
|
|
#18 |
|
Specious Misanthrope
Join Date: May 2003
Location: Treading Water
Posts: 7,459
|
|
|
|
|
|
|
#19 |
|
Member
Join Date: Sep 2010
Posts: 1,002
|
That's a question of human psychology, everyone has his/ her own criteria for satisfaction.
If you ask me, personally, then my own 6870 is the absolute minimum for satisfaction. The more the better you know. And more people understand it when they have to deal with the awful performance of those integrated solutions (even when browsing and scrolling down some web pages there is a possibility you feel how weak actually they are) - they simply won't have the freedom to launch everything they want... The price is not that much of a problem I think. I mean in the sane zone of prices- like 50-100-150 $. |
|
|
|
|
|
#20 |
|
Senior Member
|
Not to mention power, physical size, easier heterogeneous computing.
__________________
"Well, you mentioned Disneyland, I thought of this porn site, and then bam! A blue Hulk." —The Creature My (currently dormant) blog: Teχlog |
|
|
|
|
|
#21 |
|
Senior Member
|
Results from several synthetic tests under OCL:
![]() ![]() ![]()
__________________
Apple: China -- Brutal leadership done right.
Google: United States -- Somewhat democratic. Microsoft: Russia -- Big and bloated. Linux: EU -- Diverse and broke. |
|
|
|
|
|
#22 |
|
Entirely Suboptimal
Join Date: Mar 2003
Location: WI, USA
Posts: 6,845
|
For desktop use? That's craziness. These modern CPUGPUs are as capable as some not-so-low-end discrete cards of recent years. Hell, I can be happy using Aero on GMA 950 for most desktop stuff.
|
|
|
|
|
|
#23 |
|
Red-headed step child
Join Date: Jun 2004
Location: Guess ;)
Posts: 3,084
|
The GMA that comes with my wife's ancient Atom N270 netbook is absolutely sufficient to drive Win7 Aero Glass on my Dell U2711 at 2048x1152 (maximum analog VGA output rez.) She tools around on that box all day without complaint.
Sure, the Atom processor itself is pretty gutless, but she uses the entire Office 2010 suite, Quicken, and Internet Exploder v9 without fuss. She knows that my personal laptop is 'faster', but she also doesn't know how to engage the ATI card so it just runs on the Intel HD graphics built into the Allendale i5. She only notices the speed increase when working with the RAW files from our DSLR though, which actually isn't graphics-subsystem intensive, it's all CPU work. Both of my parents, both of my step-parents, both of my brothers-in-law, and my parents-in-law all use Intel integrated graphics on their various laptops and desktops without issue or complaint. I would know, because they call me when it IS slow
__________________
"...twisting my words" |
|
|
|
|
|
#24 |
|
Invisible Member
Join Date: Apr 2002
Location: La-la land
Posts: 4,985
|
Never tried any intel integrated graphics before prior to sandy bridge, but that level is quite sufficient for general OS stuff, even at 2560*1440. Ivy bridge, being considerably faster, would just be bonus cake on top...
Haswell, if it is as speedy as rumored, would even be decent gaming material, especially if coupled with some fast, say ~2.3GHz, DDR3. I'm really curious to see what Intel will cook up for future CPUs, they really seem to be on a roll. Considering CPU performance has been plateauing for a good while now, it stands to reason that most of the extra transistors that will become available by smaller processes - and hence also an increasing part of the power budget - will go to the graphics co-processor in future CPUs, and that is quite exciting IMO.
__________________
"If I were a science teacher and a student said the Universe is 6000 years old, I would mark that answer as wrong (why? Because it is)." -Phil Plait |
|
|
|
|
|
#25 |
|
Senior Member
Join Date: Feb 2004
Posts: 2,440
|
My old 8500 GT was entirely insufficient for the Windows Vista version of Aero. I turned off Aero and things sped way, way up.
|
|
|
|
![]() |
| Thread Tools | |
| Display Modes | |
|
|