Reverend said:
In reply to the original post -- this is usually a closely guarded internal secret and is considered proprietary information.
But the funny thing is, I bet it's like an 'open secret.' Embedded RAMs, whether they be SRAM, 1T-RAM, eDRAM, flash, present *unmistakeable* visual signatures on die-photographs. Even an untrained observer (like myself) could easily identify a RAM-block (larger than 1024 bytes) on a blow-up die photograph, thanks to their rectangular structure and regular (repetitive) structure. Once identified, it's a simple matter of taking a ruler or other straight-edge, tabulating size of all such arrays. From here, one could deduce a very good estimate of the RAM's transistor count based on known public info, like the process lithography size and RAM-technology (SRAM, 1T-SRAM, eDRAM.) So basically, companies who include die-photographs in their PR-kit, are tacitly giving away this 'secret.'
The core (digital-logic) datapath will NOT have the same kind of regular structure. And analog/mixed-signal circuits (PLLs, RAMDACs) have very large feature sizes that easily rule them out as anything except analog blocks.
Anyone has some facts about tranistor density of SRAM cells vs logic gates?
If NVidia's using the standard-cell Artisan library (given to TSMC customers free of charge), the Artisan library includes an SRAM memory-macro compiler. The macro-compiler auto-generates EDA/tool-views for layout and routing. All the info needed to answer your question is in the design kit, but unfortunately the customer must sign an NDA to acquire the design kit (even though it doesn't cost anything.) The memory-compiler for 0.18u (which I've used) can generate a variety of SRAM configs - single-port (R/W), two-port (R + R/W), and true dual-port (R/W + R/W.)
Asking about logic gate size is like asking about the length of an x86-instruction ... depends on the gate's function (OR, NOT, AND, etc.) and its drive-strength (X1, X2, X4, X8, etc.) And finally, standard-cell gates are laid out on a 'grid', so the gate's active area can be somewhat smaller than the grid's granularity. ATI has already said they use custom-design techniques in their digital-logic synthesis, meaning speed (or area) critical blocks are hand-designed (allowing the designer to defeat the grid restrictions.)