From: iajzenszmi on
Jacket Overview
Note: GPU = Graphical Processor Unit.
Jacket is a GPU engine for MATLAB®. Jacket enables standard MATLAB
code to run on the GPU, connecting the user-friendliness of MATLAB
directly to the speed and visual computing capability of the GPU.

Jacket is not another GPU API, nor is it simply a collection of GPU
MEX functions. Rather, it is a complete and transparent system,
automatically making memory transfer and execution optimization
decisions. Jacket uses a compile on-the-fly system to allow GPU
functions to run in MATLAB's interpretive style. Currently, Jacket is
built on NVIDIA's CUDA technology.

Jacket also includes the Graphics Toolbox for MATLAB (now available
for ALL operating systems). The Graphics Toolbox integrates the Jacket
computational engine with the full OpenGL capabilities of your GPU.
This coupling of computation and graphics allows you to develop true
Visual Computing applications.

The world's first teraflop many-core processor
NVIDIA® Tesla™ computing solutions enable the necessary transition to
energy efficient parallel computing power. With 240 cores per
processor and based on the revoluationary NVIDIA® CUDA™ parallel
computing architecture, Tesla scales to solve the world's most
important computing challenges—more quickly and accurately.

Tesla C870

The NVIDIA® Tesla™ C870 GPU computing processor is a massively multi-
threaded processor architecture that is ideal for high performance
computing (HPC) applications used by scientists, engineers, and other
technical professionals.

The Tesla C870 GPU computing processor transforms a standard
workstation into a personal supercomputer. With 128 streaming
processor cores, the CUDA C-language development environment and
developer tools, and a range of applications already ported, the Tesla
C870 enables professionals to develop applications faster and solve
problems that traditionally required access to a shared server

Massively Multi-threaded Processor Architecture
Solve compute problems at your workstation that previously required a
large cluster

128 Floating Point Processor Cores
Achieve up to 350 GFLOPS of performance (512 GFLOPS peak) with one
C870 GPU

Multi-GPU Computing
Solve large-scale problems by dividing it across multiple GPUs

Shared Data Memory
Groups of processor cores can collaborate using shared data

High Speed, PCI-Express Data Transfer
Fast and high-bandwidth communication between CPU and GPU

Submitted by Mr Ian Martin Ajzenszmidt