Sega Dreamcast/Hardware comparison
From Sega Retro
This article needs cleanup. This article needs to be edited to conform to a higher standard of article quality. After the article has been cleaned up, you may remove this message. For help, see the How to Edit a Page article. |
Contents
Vs. PC
The Sega Dreamcast's PowerVR CLX2 GPU was the basis for the PowerVR PMX1, a PC GPU released with the Neon 250 graphics card in 1999. However, the Neon 250 lacks many of the tiled rendering features of the CLX2: the tile size is halved from 32×32 pixels to 32×16 pixels (halving the fillrate), it lacks the CLX2's internal Z-buffering and alpha test capability with hardware front-to-back translucency sorting (further reducing the fillrate and performance, as well as requiring the Neon 250 to render a Z-buffer externally), and the tiling is partially handled by software (the CLX2 handles the tiling entirely in hardware). The Neon 250 also lacks the CLX2's latency buffering and palettized texture support while VQ texture compression performance is halved, and it has bus contention due to having a single data bus (whereas the CLX2 has two data buses). The PowerVR2 was also optimized for the Hitachi SH-4's geometry processing capabilities (rather than for a Pentium II or III), while PC drivers and software were not optimized for the Neon 250's tiled rendering architecture (compared to Dreamcast games which were optimized for the CLX2's tiled rendering architecture). The Neon 250 thus had only a fraction of the Dreamcast CLX2's fillrate and rendering performance. The reduction in performance from the Dreamacst's CLX2 to the Neon 250 was comparable to the reduction in performance from the Sega Model 3's Real3D Pro-1000 to the Intel740.
The Dreamcast was generally the most powerful home system during 1998–1999, outperforming high-end PC hardware at the time.[1] The Dreamcast's Hitachi SH-4 CPU calculates 3D graphics four times faster than a Pentium II from 1998,[1] and faster than a Pentium III and NVIDIA GeForce 256 from 1999. The Dreamcast's PowerVR CLX2 GPU, due to its tiled rendering architecture, also has has a higher fillrate than a Voodoo3 and GeForce 256 from 1999.
The Dreamcast's CPU–GPU transmission bus is faster than the Voodoo3 and has a higher effective bandwidth than the GeForce 256 due to the Dreamcast's efficient bandwidth usage, including its lack of CPU overhead from the operating system and the CLX2's tiled rendering architecture: textures loaded directly to VRAM (freeing up CPU–GPU transmission bus for polygons), higher texture compression, on-chip tile buffer with internal Z-buffering, and deferred rendering (no need to draw, shade or texture overdrawn polygons). The CLX2 was also capable of order-independent transparency (which the Voodoo3 and GeForce 256 lacked) and Dot3 normal mapping (which the Voodoo3 lacked).[2] The CLX2's rendering throughput was 7 million polygons/s, with game engine performance peaking at 5 million polygons/s,[3] faster than the Voodoo3, and also faster than the GeForce 256, which has a peak rendering throughput of 6.7 million polygons/s[4] and peak game engine performance of 2.9 million polygons/s.[5]
Dreamcast game engines rendered 50,000–166,666 polys per scene (3–5 million polygons/s),[3] while PC game engines of 1999 rendered up to 10,000 polys per scene[6][7] (1–1.6 million polygons/s).[8] Character models in particular were significantly more detailed in Dreamcast games than in PC games during 1998–1999.[9]
Vs. PlayStation 2
Compared to the rival PlayStation 2, the Dreamcast is better at textures, anti-aliasing, and image quality, while the PS2 is better at polygon geometry, particles, and lighting. The PS2 has a more powerful CPU geometry engine (6.2 GFLOPS Emotion Engine), higher translucent fillrate (2.4 GPixels/s), and more main RAM (32 MB, compared to DC's 16 MB), while the DC has more VRAM (8 MB, compared to PS2's 4 MB), higher opaque fillrate (3.2 GPixels/s), and more GPU hardware features, with CLX2 capabilities like tiled rendering, super-sample anti-aliasing, Dot3 normal mapping, order-independent transparency, and texture compression, which the PS2's Graphics Synthesizer GPU lacks.
With larger VRAM and tiled rendering, the DC can render a larger framebuffer at higher native resolution (with an on-chip Z-buffer), and with texture compression, it can compress around 20–60 MB of texture data in its VRAM. Because the PS2 has only 4 MB VRAM, it relies on the main RAM to store textures, but the PS2's CPU–GPU transmission bus for transferring polygons and textures has a bandwidth of 1.2 GB/s; while 50% faster than the DC's 800 MB/s CPU–GPU transmission bus, the DC has textures loaded directly to VRAM (freeing up the CPU–GPU transmission bus for polygons) and texture compression gives it around 2–6 GB/s of effective texture bandwidth. DC games were effectively using 20–30 MB of texture data[10] (compressed to around 5–6 MB),[11] while PS2 games up until 2003 peaked at 5.5 MB of texture data (average 1.5 MB).
PS2 games up until 2003 rendered up to 7.5 MPolys/s (145,000 polys per scene), with most rendering 2–5 MPolys/s (average 52,000 polys per scene);[12] in comparison, DC game engines rendered up to 5 MPolys/s (166,666 polys per scene), with most games rendering 2–4 MPolys/s (average 50,000 polys per scene).[3]
The DC is more user-friendly for developers, making it easier to develop for, while the PS2 is more difficult to develop for; this is the reverse of the 32-bit era, when the PlayStation was more user-friendly, and the Saturn more difficult, for developers.
Vs. GameCube and Xbox
The Xbox and GameCube were both more powerful than the Dreamcast, but the Dreamcast had several hardware advantages. The Dreamcast has a higher opaque fillrate (3.2 GPixels/s) than the GameCube and Xbox (both under 1 GPixels/s). The Dreamcast's translucent fillrate (500 MPixels/s) was comparable to the Xbox's practical fillrate (250-700 MPixels/s), but lower than the GameCube's fillrate (648-800 MPixels/s).[13] The Dreamcast's SH-4 CPU has a faster floating-point performance (1.4 GFLOPS) than the Xbox's PIII-based CPU (733 MFLOPS), but lower than the GameCube CPU's floating-point performance (1.9 GFLOPS). However, the GameCube and Xbox have T&L GPU with floating-point capabilities, giving both faster floating-point performance than the Dreamcast.
Graphics comparison
- See Sega Dreamcast technical specifications for more technical details on Dreamcast hardware
System | Sega Dreamcast (1998) | PC (1998) | PC (1999) | ||
---|---|---|---|---|---|
Geometry processors | Hitachi SH-4 (200 MHz) |
Intel Pentium II (450 MHz) |
Intel Pentium III 800EB (800 MHz) |
NVIDIA GeForce 256 (120 MHz) | |
Floating-point operations | 1400 MFLOPS[14] | 350 MFLOPS[n 1] | 800 MFLOPS[15][16] | 530 MFLOPS[4] | |
T&L calculations | 10 million polygons/s[14] | 2.5 million polygons/s[n 2] | 6.7 million polygons/s[4] | 4.4 million polygons/s[4] | |
Rendering processors | NEC-VideoLogic PowerVR CLX2 (100 MHz) |
3dfx Voodoo Banshee (100 MHz) |
3dfx Voodoo3 3500 TV SE (200 MHz) |
NVIDIA GeForce 256 (120 MHz) | |
Tiled rendering calculations | 200 MFLOPS | N/A | N/A | N/A | |
Rendering fillrate |
Opaque polygons | 3200 megapixels/s[14] | 100 megapixels/s | 200 megapixels/s | 480 megapixels/s |
Opaque/Translucent polygons |
500 megapixels/s[17] | ||||
Texture capabilities |
Texture fillrate | 500 megatexels/s | 100 megatexels/s | 400 megatexels/s | 480 megatexels/s |
Texture compression | 8:1 (VQ) | 1:1 (N/A) | 4:1 (FXT1) | 6:1 (S3TC) | |
CPU–GPU transmission bus |
Bandwidth | 800 MB/s[14] | 533 MB/s[n 3] | 533 MB/s[n 4] | 1 GB/s[n 5] |
Effective texture bandwidth |
6.4 GB/s | 533 MB/s | 2 GB/s | 6 GB/s | |
Polygon rendering throughput | 7 million polygons/s[14] | 750,000 polygons/s[n 6] | 1.8 million polygons/s[n 7] | 6.7 million polygons/s[4] |
Notes
- ↑ [Dreamcast CPU four times faster than Pentium II[1] Dreamcast CPU four times faster than Pentium II[1]]
- ↑ [Dreamcast CPU four times faster than Pentium II[1] Dreamcast CPU four times faster than Pentium II[1]]
- ↑ AGP bus (2x AGP 2.0)[18]
- ↑ [AGP bus (2x AGP 2.0) AGP bus (2x AGP 2.0)]
- ↑ [Transmission bus from Pentium III 800EB (133 MHz FSB, 1 GB/s) to GeForce 256 (4x AGP 2.0)[18] Transmission bus from Pentium III 800EB (133 MHz FSB, 1 GB/s) to GeForce 256 (4x AGP 2.0)[18]]
- ↑ [The Celeron 300A 450 MHz[19] (100 MHz FSB,[8] 364 MFLOPS)[20] with Voodoo3 TV (183 MHz) renders 750,000 polygons/s[21] The Celeron 300A 450 MHz[19] (100 MHz FSB,[8] 364 MFLOPS)[20] with Voodoo3 TV (183 MHz) renders 750,000 polygons/s[21]]
- ↑ [The Celeron 300A 450 MHz (100 MHz FSB, 364 MFLOPS) with Voodoo3 TV (183 MHz) renders 750,000 polygons/s The Celeron 300A 450 MHz (100 MHz FSB, 364 MFLOPS) with Voodoo3 TV (183 MHz) renders 750,000 polygons/s]
References
- ↑ 1.0 1.1 1.2 1.3 File:GamersRepublic US 03.pdf, page 29
- ↑ [PC Magazine, December 1999, page 193 PC Magazine, December 1999, page 193]
- ↑ 3.0 3.1 3.2 Test Drive: Le Mans (IGN)
- ↑ 4.0 4.1 4.2 4.3 4.4 Benchmarking T&L in 3DMark 2000
- ↑ Actual HW T&L perfomance of NVIDIA GeForce/GeForce2 chips (IXBT Labs)
- ↑ [PC Magazine, December 1999, page 203 PC Magazine, December 1999, page 203]
- ↑ Unreal Modeling Guide (Unreal Developer Network)
- ↑ 8.0 8.1 '95-'99 PC Comparisons
- ↑ DF Retro: Shenmue - A Game Ahead Of Its Time (Digital Foundry)
- ↑ Hideki Sato Sega Interview (Edge)
- ↑ How Many Polygons Can the Dreamcast Render?
- ↑ Reaching for the Limits of PS2 Performance: How Far Have We Got? (2003) (SCEE) (Wayback Machine: 2003-12-10 07:46)
- ↑ Graphics Processor Specifications (IGN) (Wayback Machine: 2001-03-31 05:05)
- ↑ 14.0 14.1 14.2 14.3 14.4 Sega Dreamcast: Implementation (IEEE) (Wayback Machine: 2000-08-23 20:47)
- ↑ Automatic Performance Tuning of Sparse Matrix Kernels, Volume 1, page 14
- ↑ Cluster Computing, page 9
- ↑ File:Edge UK 067.pdf, page 11
- ↑ 18.0 18.1 AGP Peak Speeds
- ↑ 20th anniversary Pentium specs leak – will this be the modern era’s Celeron 300A? (ExtremeTech)
- ↑ Recent Advances in Parallel Virtual Machine and Message Passing Interface, page 301
- ↑ 3DMARK 2001SE Benchmarks