Showing posts with label GPU. Show all posts
Showing posts with label GPU. Show all posts

5/03/2010

AMD GPU's new instruction - BitAlign

4/20/2010

Results for SHA1 & MD5 on HD5870 and new version of ighashgpu

AMD HD 5870 - SHA1 : 1350M/sec
AMD HD 5870 - MD5 : 3185M/sec




translate.googleusercontent.com/translate_c?hl=ru&ie=UTF-8&sl=ru&tl=en&u=http://www.golubev.com/blog/%3Fp%3D20&rurl=translate.google.com&usg=ALkJrhhhUMCtO7Sn4NE0kFVTz-QZvCYxzA


I was curious enough to test performance for SHA-1 today. As I expected bitalign usage even more noticeable for SHA-1 than for MD5. Theoretically speed-up can be as large as 50%, however as always there are some details.
At first, my SHA-1 wasn't good enough at ighashgpu v0.62. By slightly changing algorithm I've got 15% better results. Then I've added bitalign - another 40% for HD 5XXX and finally I've removed last 4 rounds from SHA-1 ("reversed" in other words). Last optimization was already done earlier for CUDA code, now I've just applied it to ATI code. It's another 5%. So, all in all, performance for single SHA-1 hashes at HD 5XXX now 71% better than it was.Impressive, isn't it? :) .
As I Feel Lazy to Test all These major changes and big speed-ups I've decided Ongoing to Release Intermediate version of ighashgpu (Call it alpha?), You CAN Download it here . Not all kernels changed, basically only single MD5 and all SHA-1 related ones updated. By the way, ATI version can now supports passwords (+ optional salt) up to 48 symbols long (== Joomla). nVidia code wasn't updated for this.
And one more thing about nVidia CUDA code, I've changed a bit the way passwords distributed among threads / blocks. As a result, there is small speed-up, like 2-3% for all CUDA kernels. It doesn't looks like huge thing but when utilization already over 95% these 2-3% are very nice actually.
Also, I've finally fixed / sf + / m usage bug (I hope so, at least).
So, I'm interesting in some feedback, especially results on 5970's and GTX285/295.

1/03/2010

FASTRA2

http://fastra2.ua.ac.be


FASTRA II: the world’s most powerful desktop supercomputer

FASTRA
Is it possible to fit the computing power of a large supercomputer cluster in the tight space of a PC case? In our research on image reconstruction we often have to perform large-scale scientific computations, which can easily take weeks on a normal PC. Last year, the FASTRA project was launched to develop a desktop supercomputer based on gaming hardware. Although highly successful, even FASTRA cannot provide the computational power required for our latest research projects. FASTRA needs a successor, which should be much more powerful, while maintaining the favorable properties of its older brother: green, mobile and inexpensive. For just 6000 euros, you can have 12TFLOPS of computing power at your fingertips.
Part of the Vision Lab of the University of Antwerp, the research group ASTRA focuses on the development of new computational methods for tomography. Tomography is a technique used in medical scanners to create three-dimensional images of the internal organs of patients, based on a large number of X-ray photos that are acquired over a range of angles. ASTRA develops new reconstruction techniques that lead to better reconstruction quality than classical methods.
One of the applications is 3D imaging of bone tissue in mice, which is commonly required in medicine research for osteoporosis. The structures of interest are at the resolution limits of current micro-CT scanners. We are working on advanced computational methods that allow for the computation of higher resolution images based on the same scanner data. The downside: computation time, which was already a major issue, increases even further.

Fortunately, these computations can be carried out in parallel on graphics hardware, much faster than when using normal CPUs. Graphical Processing Units (GPUs) are becoming more and more common now for all kinds of scientific computing. For suitable applications, a single GPU already has the computing power equivalent to a moderate CPU cluster. In collaboration with Tones.be and ASUS, We have now developed a PC design that incorporates 13 GPUs, resulting in a massive 12TFLOPS of computing power.
Although the system is up and running, we are still experiencing software stability issues, probably caused by an incompatibility between the video drivers and the BIOS and Linux modifications we had to use. Check out the blog for more details on the current status of FASTRA II.
The FASTRA II design contains six NVIDIA GTX295 dual-GPU cards, and one GTX275 single-GPU card. To fit all this hardware in a single PC case, a special cage was designed for the graphics cards, which are connected to the motherboard by flexible riser cables. To satisfy all 13 GPUs, The system has four power supplies. At full speed, it can outperform a moderately sized cluster of state-of-the-art CPUs. And guess what… this system costs less than 6000 euros!

Specifications

Hardware overview
Hardware assembly
Software overview

Hardware overview

Case: Lian-Li PC-P80 Armorsuit

Lian-Li PC-P80 Armorsuit
The PCP80 case, which was also used in the FASTRA I build, provides a massive amount of working space and offers 9 expansion slots at the back of the case. Although the graphics cards in FASTRA II do not directly fit into these slots, it provides big ventilation gaps for releasing exhaust heat from the cards. The case had to be modded slightly for this project, by drilling holes for attachment of the GPU rack screws.

Motherboard: ASUS P6T7 WS Supercomputer

ASUS P6T7 WS Supercomputer
The Asus P6T7 motherboard is the only workstation motherboard available that has seven full-size PCI Express slots. The X58 chipset is connected to two additional NForce 200 chips that distribute PCI Express bandwidth between the seven slots.

CPU: Intel Core i7 920

Intel Core i7 920

Managing 13 GPUs simultaneously requires heavy multithreading on the CPU side, requiring a multicore CPU. As nearly all computational load is shifted to the GPUs, we opted for the CoreI7 920, which is highly affordable, while allowing for future upgrade possibilities.

Memory: 6×2GB Corsair DDR3 1333

6x 2GB Corsair DDR3 1333MHz

Having as much RAM as possible is crucial for our type of applications. 12GB is usually sufficient to load large 3D volumes (e.g. 1024×1024x1024) completely in memory. We would have loved more memory though. Compared to a “real” supercomputer we have only a tiny amount of memory at our disposal. The Corsair memory has decent timings, at an affordable price point. Remarkably, the total amount of GPU memory in the FASTRA II system is about the same as the total amount of system RAM. It is not strictly necessary that all GPU memory is backed up by an equal amount of system RAM, though.

Harddisk: Samsung Spinpoint F3 1TB

Samsung Spinpoint F3 1TB

At first this choice may seem somewhat strange. The Spinpoint harddrive is not very fast compared to more expensive models such as the WD Raptors or Solid State Disks. However, we observed that in our case, disk access is not a performance bottleneck at all. In particular, having a single harddisk improves the airflow through the case and keeps the system very tidy.

Power Supply: Thermaltake Toughpower 1500W + 3x Thermaltake PowerExpress 450W

Thermaltake Toughpower 1500W Modular PSUThe Thermaltake Toughpower already proved itself in the FASTRA I design. It has four PCI Express x6 and four PCI Express x8 connectors and powers four of the GTX295 cards. However, this power supply cannot power all graphics cards simultaneously. As the bottom of the case is already occupied, we decided to use the special VGA power supply offered by Thermaltake, which fits into a drivebay. Each PSU has connectors to power two graphics cards, but we use only one. This PSU takes one drive bay, as opposed to its bigger 650W brother, which takes two drive bays.

Graphics Cards: ASUS ENGTX275 + 4x ASUS ENGTX295 (2PCB) + 2x ASUS ENGTX295 (1PCB)

Asus NVIDIA GTX295

As the system has been assembled over a rather long period of time, two types of GTX295 cards have been used. We started with a series of dual-PCB cards (part no. 90-C3CGX0-K0UAY00T). The more recent single-PCB cards (part no. 90-C3CGX5-K0UAY00T) generate less heat, which can also be obeserved clearly in ourheat camera images. For the single-GPU card, which is connected to the screen, we opted for the GTX275, which is newer than the GTX285 and almost as powerful. This card has to be a single-GPU card for technical reasons, restricting us to a 13-GPU system (and not a 14-GPU one).

Flexible PCI Express risers: Adex Electronics PE-FLEX16 gen. 2 risers

ASUS P6T7 WS Supercomputer
The flexible risers from Adex Electronics can be ordered in different lengths, and allow for sufficient flexibility to connect all seven dual-slot graphics cards to the tightly spaced PCI Express slots of the motherboard.

GPU suspension cage: Custom Design

ASUS P6T7 WS Supercomputer
A strong cage is required to keep all cards in place above the motherboard. In collaboration with Tones.beand the firm LASERTEK N.V., a cage was designed and manufactured out of aluminium that meets the requirements.

Hardware Assembly

Tones.be

The Belgian computer shop Tones.be provided assistance and support during this project, and performed the assembly of FASTRA II. They managed to deliver a very clean build, despite the vast number of power and riser cables involved.

Software overview

Operating System: Linux, CentOS 5.3

Centos
We selected CentOS because it provides a stable environment that doesn’t need much maintenance. Instead of the standard CentOS Linux kernel, we used a custom 2.6.29.1 kernel.

Tomography Code: C++ and MATLAB 2009b

C++
We use portable C++ for the core functionality of our software. In Windows, we useMicrosoft Visual Studio 2005, and on Linux, the C++ code can be compiled using the GNU C++ compiler. We’ve also developed a front-end for MATLAB. MATLAB has an easy to use interface and thus allows rapid prototyping of new algorithms. All GPU code is developed using the NVIDIA CUDA framework, a C-like programming language that allows for efficient programming of the NVIDIA GPUs.

11/18/2009

Multi-GPU Radeon HD 5000 series video card "Radeon HD 5970"

AMD is November 18, Radeon HD 5000 series GPU Multi-core video card is "Radeon HD 5970" announced. Radeon check the performance of the new flagship product. 

● Equipped with two full-spec based on the Cypress, the clock specification 5850


Hemlock had been predicted by the code name "Radeon HD 5970" is, Radeon HD 5000 series architecture based on multi-GPU is the video card. Radeon HD 3000/4000 generation, product name at the end of "X2" multi-GPU and it was put to clarify that the video card, this generation has shown that the Radeon HD 5800 series lineup top The model name is adopted.

AMD will use here is borrowed from the reference board, the board length is very long (photo 1,2). AMD multi-GPU on one PCB video card with two GPU put the style (Photo 3) and sticking to the board tend to be really long.




p1.jpg
[Photo 1] Radeon HD 5970 reference board


ph02.jpg
[Photo 2] long board is about to end, including the design of 310mm. Radeon HD 5870 more than 3cm long and is

ph03.jpg

[Photo 3] The back of the reference board. GPU single PCB to the two shows that are listed


The main specifications are as shown in Table 1. Once a brief, 20 SIMD units have enabled all of the Radeon HD 5870 for full-spec engine, the same group with two core Cypress, the clock will be called Radeon HD 5850 running at an equivalent product. AMD demonstrated its architecture in Figure 1 is a description of materials. PLX Techology Cypress Core 2 based chip will be connected by a bridge style. This is like the Radeon HD 4870 X2/3870 X2.


Table 1 Radeon HD 5970 Specifications


Radeon HD 5970Radeon HD 5870Radeon HD 5850
Process rules40nm 
Core clock725MHz850MHz725MHz
SP Number1,600 units × 21,600 units1,440 units
Texture Units80 units × 280 units72 units
Memory1GB × 2 GDDR51GB GDDR5 
Overclocking1,000 MHz1,200 MHz1,000 MHz
Memory Interface256bit × 2256bi t 
ROP Units32 units × 232 units 
Power Board (idle)42W27W 
Power Board (peak)294W188W151W



ph03_1.jpg

[Figure 1] Block diagram of the Radeon HD 5970


Incidentally, the reference board try to remove the cooler, GPU, and each of the two units can be sure that 1GB of GDDR5 memory and being (Photos 4-7). H5GQ1H24AFR-T2C on Hynix's memory, 5.0Gbps memory chip that supports up. 4.0Gbps say that there is enough for chips.


ph03_2.jpg
[Photo 4] after removing the air conditioner. Cypress both sides of the core GPU, the central bridge chips
ph03_3.jpg
 [Photo 5] each GPU connected to the memory of 8 total. Each per GPU, the surface of four sheets, the back four sheets to implement
 ph06.jpg
[Photo 6] The video memory is Hynix H5GQ1H24AFR-T2C. 1Gbit chips each GPU to eight pieces each, and a total of 2GB
 ph06_1.jpg

[Photo 7] Vapor Camber cooler is that adopted the thermal conductivity (transport) which increases


And have adopted such a memory, Radeon HD 5970 is a point that is also appealing design that is over with the clock (Figure 2-3). Power consumption, but later, 300W while running clock set to exceed the current frame that leaves room for the tune score, suggesting that the idea.


ph06_2.jpg
[Figure 2] Radeon HD 5970 is designed to appeal to the board with overclocking


ph06_3.jpg

[Figure 3] which also shows the design with the clock over here, in terms of features and structures, including a cooler



Bridge chip, ATI logo and "AMD8647-BBB50BC" was stamped the model number (photo 8). However, as was also the first document, the entity would PLX Technology products. AMD8647 and from the model show that, Radeon HD 4870 X2 like 48-lane PCI Express 2.0 bridge chip PEX8647 is believed to have been used.

294W peak power is in the way of the board, power supply terminal is equipped with 6-pin and 8-pin (Photo 9). PCI Express slots, +8 pin 6 can be supplied in-line configuration on the edge, and 300W power supply pin, the clock speed of the Radeon HD 5850 that is considerable and seems to be greater for this reason. The PEX8647 bridge chip, the product literature, 2.8W typical power consumption has been marked, Radeon HD 5970 from my total, and large enough to affect the power is not major.

Bracket of the Mini DisplayPort and DVI × 2 configuration is (photo 10). Eyefinity Radeon HD 5000 series features characteristic of these three can be used simultaneously to one output, Mini DipslayPort the availability of good points from the DisplayPort adapter to convert it disturbing. Most, Mini DisplayPort also finally standardized as the VESA standard so extended, the potential for improved high availability of these adapters.
 

ph06_4.jpg
[8] with two photos and GPU connection, PC with PCI Express bridge chip provides the interface "AMD8647-BBB50BC" use. PLX Technology is a chip made


ph09.jpg
[Photo 9] 6-pin power supply terminal pin configuration +8


 ph09_1.jpg

[10] pictures of the bracket DVI-I × 2 + Mini DisplayPort is and




In addition, Native CrossFire and pins, this product can be used two pieces Quad CrossFire (Photo 11). Rated as the clock is running when the idle core by ATI PowerPlay 157MHz, Memory 300MHz to ensure that the clock is down to (screen 1)
ph09_2.jpgph09_3.jpg

9/13/2009

HD 5870 with 1024 and 2048 MiB VRAM


The day before yesterday we revealed the specifications of the HD 5870. Those also contained 1600 stream processors, 80 TMUs and 32 ROPs, and with that, definitely twice of the HD 4850/4870/4890. This also leaves a mark on the transistor budget of the chips, which is now twice as high with 2.15 Billion circuits. Despite the 40 nm technique the chip size amounts 338 mm², so the chip is the second-biggest AMD chip after the R600.
.HD 4870HD 4890HD 5850HD 5870
Technique55 nanometres55 nanometres40 nanometres40 nanometres
Transistors0,96 Billion0,96 Billion2,15 Billion2,15 Billion
Chip size260 mm²282 mm²338 mm²338 mm²
Chip frequency750 MHz850 MHz725 MHz850 MHz
Memory frequency900 MHz975 MHz1000 MHz1200 MHz
Stream processors80080014401600
Shader performance1200 Gigaflops1360 Gigaflops2088 Gigaflops2720 Gigaflops
TMUs40407280
Texel fill rate30.000 MT/s34.000 MT/s52.200 MT/s68.000 MT/s
ROPs16163232
Pixel fill rate12.000 MP/s13.600 MP/s23.200 MP/s27.200 MP/s
Memory interface256 Bit GDDR5256 Bit GDDR5256 Bit GDDR5256 Bit GDDR5
Memory bandwidth115,2 GB/s124,8 GB/s128,0 GB/s153,6 GB/s
Prices--$299$399 (1024M)
$449 (2048M)

As we could learn now, the HD 5870 will appear in several versions, one with 1024 and another with 2048 MiB texture memory, although it is still uncertain through the loss of official benchmarks if the bigger memory adds up at all. The doubled memory will result in 50 USD more in price. This means for the HD 5870 with 1024 MiB texture memory still to be at 399 USD, while you just find a < 400 USD on AMD's newest documents, so 449 USD for the 2048 MiB version.
ATI Radeon HD 5870 (4 Bilder)
ATI Radeon HD 5870
The interesting thing is, that, against our last report, the HD 5870 with 1024 MiB will not work with 825/1300 MHz, but with 850/1200 MHz. If this follows a small last-minute alternation or if the 825/1300 MHz are meant to be from the 2048 MiB version, we will not know until the launch day, which will still be September 23. The performance of the HD 5870 should not experience any major changes.

9/10/2009

Radeon HD 5870 Spec!!

The Radeon HD 5870 has 1600 shaders and runs at 825MHz. GDDR5 memory runs at 1050MHz compared to 1300MHz on the 5870 card and this results in a memory bandwidth of 140GB/s, which is more than the HD 4870. Updated
As you can imagine, ATI's Radeon HD 5850 card is a slower and crippled version of the Radeon HD 5870 and this slower card has 1440 shaders and runs at 725MHz.
Just like the Radeon HD 5870, this card comes with 32ROPs and the maximum power consumption is 170W under load and 27W when idle. The launch date is September 23rd.

http://www.fudzilla.com/content/view/15433/1/

9/04/2009

System with 4 dual nVidia cards

Specifications

Hardware overview
Software overview

Hardware overview

Case: Lian-Li PC-P80 Armorsuit

Lian-Li PC-P80 Armorsuit

An essential requirement for the case is that it must have at least eight expansion slots, two for each dual-slot graphics card. Nearly all PC cases have at most seven expansion slots, leaving the Lian-Li PC-P80 and Thermaltake Armor as the only options we could find. Both are excellent choices: we opted for the Lian-Li case mainly for availability reasons.

Motherboard: MSI K9A2 Platinum

MSI K9A2 Platinum

We searched for a motherboard that has four physical PCI-Expressx16 slots and has double spacing between each of them. There are several boards in the market with four graphics card slots, but we could find only two that have the required double slot spacing: the MSI Quad Royal, which is now outdated and hardly available anymore, and the MSI K9A2 Platinum. We originally intended to choose a platform suitable for an Intel processor, either Skulltrail or NVIDIA 790i, but none of the available motherboards satisfied the slot requirements. Ultimately, we decided to go for an AMD chipset, as CPU performance is not a major concern in our case anyway. Note that the K9A2 motherboard has a very modest price tag and lacks several features found on more expensive motherboards.

CPU: AMD Phenom 9850

AMD Phenom 9850

A quad-core CPU is essential in our system, as there is continuous communication going on between the CPU and GPUs. Spreading these tasks among several processor cores reduces latencies and improves the overall computation speed. We opted for the new Phenom 9850 CPU, which does not have the TLB bug that originally plagued AMD's quad-core product line.

CPU cooler: Scythe infinity

Scythe infinity

This CPU cooler provides more than enough cooling for the CPU. Since we are not overclocking the CPU, any decent cooler would have done the job.

Memory: 4x 2GB Corsair TWINX DDR2 PC6400

4x 2GB Corsair TWINX DDR2 PC6400

Having as much RAM as possible is crucial for our type of applications. 8GB is usually sufficient to load large 3D volumes (e.g. 1024x1024x1024) completely in memory. We would have loved more memory though. Compared to a “real” supercomputer we have only a tiny amount of memory at our disposal. The Corsair memory has decent timings, at an affordable price point.

Harddisk: Samsung Spinpoint F1 750GB

Samsung Spinpoint F1 750GB

At first this choice may seem somewhat strange. The Spinpoint harddrive is not very fast compared to more expensive models such as the WD Raptors. However, we observed that in our case, disk access is not a performance bottleneck at all. In particular, having a single harddisk improves the airflow through the case and keeps the system very tidy.

Power Supply: Thermaltake Toughpower 1500W Modular PSU

Thermaltake Toughpower 1500W Modular PSU

The Thermaltake Toughpower was the only PSU we could find that has the necessary four PCI-Express x6 and four PCI-Express x8 connectors. An alternative would have been to go for several smaller PSUs, but this modular PSU keeps the system extremely tidy. It turns out to be more than capable of powering four dual graphics cards simultaneously, even after overclocking the shader cores on all eight GPUs by 20%.

Graphics Cards: 4x MSI 9800GX2

MSI 9800GX2

There is no dominating reason to choose MSI over competing brands for the graphics cards. We obtained a good offer from MSI for these graphics cards and decided to take it. In particular, this seemed to be a sensible choice in case we needed support, as we are also using an MSI motherboard.

Software overview

We selected Windows XP-64 as the operating system for FASTRA. There were three reasons for choosing this platform: first, we needed a 64-bit operating system, in order to utilize 8GB of RAM. Second, we expected fewer driver issues on Windows compared to Linux. Third, within the Windows product line, Windows Vista is not yet supported by the NVIDIA GPU Computing platform, leaving Windows XP as the only choice. For development, we use Microsoft Visual Studio 2008. The core functionality for our CPU code is written in C++ (Visual C++), while MATLAB is often used as a front-end for rapid prototyping. All GPU code is developed using the NVIDIA CUDA framework, a C-like programming language that allows for efficient programming of the NVIDIA GPUs.

http://fastra.ua.ac.be/en/specs.html


4 HD4870X2!! why impossible?

I registered to this board to hopefully make some clarifications from all of what I've read above related to Four 4870X2 video cards in a single system. Please feel free to debate the issue further if you feel that anything I say is wrong. I'm not after being right, only the truth, I've spent a great deal of time addressing this issue.

1. The chipset does not determine a motherboard's ability to support Four 4870X2 GPUs (or a total of 8 GPUs). The driver does. Of course the motherboard needs to have a minimum of Four PCIe Gen 2 x16 lanes.

2. NVIDIA did say that they tested a system with 8 GPUs (Four 4870X2s), however: "As an aside, AMD has already built a computer that has four 4870X2s in it. So it has eight GPUs; drivers will not be supporting eight GPUs at this point of time," Mr. Hook said.
http://www.xbitlabs.com/news/video/d...ics_Cards.html

3. There are MANY extremely good reasons for a system to be built with Four 4870X2s. Any type of application that follows a SIMD (Single Instruction Multiple Data) design can hugely benefit from the massive parallel processing capacities such a system could provide, even despite bandwidth concurrency and latency issues that would unfold as the PCIe Gen 2.0 x16 slots auto negotiated down to a bandwidth of 4 x 4 x 4 x 4. For instance I play a role in the security community and my specialty is cryptanalysis. Such a machine could have many benefits in my field of research. The programming technique used to unlock the potential of the GPUs is called stream programming or GPGPU (General Purpose Computations on GPUs). Oh, and did I mention such a system would have a potential of 9.6 TFLOPs? (Super computer status, indeed).

4. As of this day, due to driver support among other bandwidth PCIe related issues there is no way such a system would have any advantage as a "Gaming System" (aside of course from the Two out of the Four 4870X2s that you COULD use in a CrossFireX configuration). Also, the 4870X2 cards use double wide spacing so one needs to do one of two things. Either build/buy a case that can support more than 7 extension cards (the standard) OR remove the default fan/heatsink cooling system from the video cards and install a water cooling solution which would allow for far less room being occupied.

5. There are indeed MOTHERBOARDS that support such a system. One is the MSI K9A2 Platinum. There is another made by MSI that also works (AND READ CLOSELY TO THIS PART) ... the motherboards work as far as detecting the devices within the PCIe slots, assining them they're appropriate device addresses, and auto negotiating the 4 PCIe lanes down (for this is a non operating system/driver level function). Now, as of Catalyst 8.9 (the most recent available Radeon driver) 8 GPUs (Four 4870X2s) do not seem to be supported. Now you'll notice I just said "seem". Perhaps that was an incorrect choice of words for the most compelling evidence I have to present to this point is this:



The above is ONE of my system. I have tried two different motherboards, and while I've found there is some tricky programming I can do to access each card at an extremely low level, the driver from ATI most certainly does not support this configuration. Also, I've tried every available operating system ATI supports, many it does not, and I've even spent a great deal of time dissasembling the driver in order to find a "hacked" solution... I have had no luck yet. I did contact MSI and they verified that theoretically Four 4870X2s is possible. They agree also that it is a matter of ATIs driver support. In addition, the FASTRA project which uses 8 GPUs of the NVIDIA variety uses that exact same motherboard for there stream processing super computer that they use for a field of study called Tomography. NVIDIA's drivers DO seem to support such a configuration with there CUDA development platform.

Last I would like to say that I very sincerely hope that I'm incorrect for regardless of all that I've said I'm still looking for a possible solution. In addition, the (I can't remember if I said this already): there is absolutely no guarantee that intel's X58 chipset will support Four 4870X2s (8 GPUs) but there is certainly also no reason that it would not, just as there is no reason that they theoretically couldn't work on present day motherboards with the mentioned requirements.

Anyway, I didn't mean to be abrupt or "bash" anyone and I apologize if I came off that way. I'm simply regurgitating what I have both experienced and learned to this point.

Regards.

http://forums.guru3d.com/showthread.php?p=2862271

Creating A5/1 Rainbow Tables

The Time/Memory Tradeoff framework is still in its early stage of development. It has been started in early 2009 and now reached basic architectural stability.

The attack on the A5/1 is a reimplementation of the attack by THC, which was done in early 2008. Our approach differs slightly, as we use more common hardware to generate the tables, namely graphics cards with GPGPU capability and attempt to build a distributed infrastructure of nodes where each node donates both a small portion of diskspace for a part of the table and some kind of fast hardware for the generation of and lookup in its own table. We also took this project as a motivation to design and code a general purpose TMTO library. The attack itself is still the same and we owe THC much for their pioneering work. Also take a look at http://airprobe.org for information and software on the sniffing of GSM data.

[Link]

8/27/2009

VRIDGE X100 - PCI Express Expander


1 PCI Express Host Card - 4 PCI Express slot board
Cost - $5,000

1. CASE

2. Host Card, Expander Board

3. Detail






chip-set Lucid HYDRA 100
external connecter PCI-Express x16 Connecter
power connecter ATX 24pin power connecter
ATX 8pin power connecter
power 4W (only chip)
bus connecter PCI-Express x16 slot
PCI-Express slot 4
OS Windows XP (32bit / 64bit)
Windows Vista (32bit / 64bit)
Linux
size 223.8mm×264.2mm
size

external connecter PCI-Express x16 connecter
bus PCI-Express x16
size 131mm×69mm×14mm
size


http://www.elsa-jp.co.jp/english/products/pes/vridge_x100_dual16/index.html