9/04/2009

Password recovery : Theoretical and practical results

And finally a summary table for the sake of which everything was started. Comparison rates for different types of passwords. Were used:

Speed was measured on a single core Intel Core 2 Q6600@2.4Ggts, ATI HD4850 at the nominal frequency 625Mgts, nVidia GTX 260 / w 192SP also at the nominal frequency 1.242Ggts.

Algorithm

Number of blocks

Required cycles

Theoretical speed

Practical speed

ATI HD4850

x

GTX 260

x

RAR 3.x

passlen = 4

(4 * 2 + 11) * 4096 + 17 = 77,841 x SHA1

13 622 175

176

160 (ARCHPR)

168 (crark)

3120

19.5

2080

12.4

RAR 3.x

passlen = 6

(6 * 2 + 11) * 4096 + 17 = 94,225 x SHA1

16 489 375

145

134 (ARCHPR)

2625

19.5

1695

12.6

WinZip AES

(1000 + 1) * 2 = 2002 x SHA1

350350

6850

6700

135K

20.1

--

--

WPA

(4096 + 1) * 2 * 2 = 16388 x SHA1

2867900

836

820

14К

17

9800

12

MS Office 2007

50005 x SHA1

8750875

274

94

--

--

3300

37.5

PDF9

1 x SHA256

--

--

5.09M

--

--

74.4M

14.6

MD5 single hash

1 x MD5 * (45/64)

51

47M

44.5M

891M

20

563M

12.7

As seen, the ratio of velocity to the GPU to speed on the CPU rather monotonous. ATI HD4850 is about 20 times faster than a single core Q6600 at 2.4GHz frequency, GTX 260 about 12 times. The value of 37.5 for Office 2007 came only because of the non-optimized CPU code (funny that Office 2007 uses SSE2, but so badly that it is better to not use). Speed sorting on the GPU depends only on the frequency of the ALU (as with all the CPU), memory speed and size mean nothing. Knowing the difference in the frequency and number of SP can be fairly accurately assess the performance of different GPU one family.

Compare directly the price the CPU and GPU rather silly - the video card can not work independently, and the CPU too. Systems with two or more processors immediately take us to another level of prices, while there is no economic sense to consider these (server) processors large L2 cache for tasks passwords attack. With GPU is the limiting factor is the number of PCI-E connectors on the motherboard. At the simplest of all, he boards one at most - two. Motherboard with 4 connectors is quite rare. And one must not only 4 slots, but also a place for themselves maps - take something they are usually 2 slot. 4 times 2 slot is found, apparently, only the MSI K9A2 Platinum.There are 6-slot P6T6 WS Revolution, but there can only put a «single» maps.

Seems put together a system with 4 ATI HD4870x2 is impossible in principle (a good description of all the problems here: http://forums.guru3d.com/showthread.php?p=2862271). System with 4 dual nVidia cards, there are ( examplehttp://fastra.ua.ac.be/en/specs.html), but the problems and there enough. Starting with a powerful power supply and case, where everything can be properly placed and finishing with installing drivers.

The overall result is a list of benchmark performance at 4-cores Q6600 at 2.4GHz Frequency:

Title

Frequency, number of kernels

Value

Price CPU or GPU

Total price for the system

E2160

1.8 x 2

0.38x

?

<$ 300

Q6600

2.4 x 4

1x

$ 220

~ $ 600

nVidia 8600 GT

1.188 x 32

0.5x

?

~ $ 650

nVidia 9800 GT

1.242 x 112

1.75x

$ 120

~ $ 720

ATI HD4770

0.75 x 640

5x

$ 100

~ $ 700

ATI HD4850

0.625 x 800

5x

$ 120

~ $ 720

nVidia GTX295

1.242 x 240 x 2

7.5x

500

~ $ 1200

ATI HD4870x2

0.75 x 800 x 2

12x

$ 400

~ $ 1100

4x ATI HD4770

0.75 x 640 x 4

20x

$ 400

~ $ 1200

2x ATI HD4870x2

0.75 x 800 x 2 x 2

24x

$ 800

~ $ 1600

4x nVidia GTX295

1.242 x 240 x 2 x 4

30x

$ 2000

> $ 3000

TACC1441 FPGA

?

2.4x

$ 4000

~ $ 4500

For systems with slow GPU (like 8600 GT) it makes sense to consider performance as a CPU + GPU (that there has not been done). For fast same GPU processor will be loaded with additional calculation and synchronization tasks, so that is additional to it simply makes no sense. In some cases, the quad-core processor may not be sufficient for systems with 8 GPU.

As can be seen, the most interesting solution in terms of price / performance is a bunch of 4-HD4770. Once again, however, requires a motherboard which can accommodate 4 such cards - despite the fact that the HD4770 chip card, the cooling system has an additional slot.

TACC1441 shown for example as an «iron» solutions. This fee is based on FPGA, supported by software from AccessData (http://www.forensic-computers.com/TACC1441.php). As you can see, there was no competition with modern GPU can not make it.

Was left a «little» problem - the almost complete absence of software for the cards ATI. Cards from nVidia are supported in ElcomSoft's Distributed Password Recovery, but with ATI until all very bad.

Conclusion: The increase in productivity from the use of a single GPU approximately 4-5-fold compared with the modern quad system. From ATI noticeably faster at the same price, but there are almost no software. With increasing number of GPU in the system of nonlinearly growing problem - there is a need for a powerful power supply, a special case, efficient cooling system, etc.


[Link]

No comments:

Post a Comment