Most likely, it is the same kind of tiling that we know from P10, XP4, Glaze3d, all Radeon series chips and probably Geforce1-4 as well: it subdivides a polygon into tiles, and for each tile draws the portion of the polygon that falls within the tile, then moves on to next tile until it is finished with the polygon - and only then does it start drawing the next polygon. A tiled framebuffer cache.
TMDS, as used by DVI, has a bandwidth of a little less than 1 GByte/sec per channel (6 differential pairs; 1.65 Gbit/sec/pair; 10 bits per byte encoding), so interchip communication would probably be limited to one chip acting as a master or passthrough for the other chip in an AFR or SLI setup.