Pankaj Mehta |
Data Partitioning SchemesTeradata uses a single partitioning scheme, a hash partition. Data are processed through an algorithm and automatically distributed across all Virtual AMPs. While this limits the DBA’s flexibility, it has proven to be an extremely effective algorithm and also eliminates data skewing. |
![]() |
| Evenly distributing up to 100 terabytes of data is the key to evenly distributing the application workload. In addition, since data placement is done without Database Administrator intervention or planning, a Teradata system has extremely low support costs. |
| How
Rows are Distributed Across the Disks
In a Teradata database, the rows of every table are distributed randomly and evenly across all of the VPROCs (the units of parallelism) in the system. The DBA is never given a choice to only populate some selected VPROCs or nodes. This even and automatic distribution ensures equal processing effort, as well as data balance across the entire system, no matter how large it grows or what type of query activity it faces. Achieving this balance will depend on the table’s primary index columns being unique or nearly unique, as discussed later. |
| Traditional File systems:
Traditional databases uses:
There is never a need for DBA-intensive activities such as database
reorgs with Teradata. |