When Will We See Collisions for SHA-1?

On a NIST-sponsored hash function mailing list, Jesse Walker (from Intel; also a member of the Skein team) did some back-of-the-envelope calculations to estimate how long it will be before we see a practical collision attack against SHA-1. I’m reprinting his analysis here, so it reaches a broader audience.

According to E-BASH, the cost of one block of a SHA-1 operation on already deployed commodity microprocessors is about 2¹⁴ cycles. If Stevens’ attack of 2⁶⁰ SHA-1 operations serves as the baseline, then finding a collision costs about 2¹⁴ * 2⁶⁰ ~ 2⁷⁴ cycles.

A core today provides about 2³¹ cycles/sec; the state of the art is 8 = 2³ cores per processor for a total of 2³ * 2³¹ = 2³⁴ cycles/sec. A server typically has 4 processors, increasing the total to 2² * 2³⁴ = 2³⁶ cycles/sec. Since there are about 2²⁵ sec/year, this means one server delivers about 2²⁵ * 2³⁶ = 2⁶¹ cycles per year, which we can call a “server year.”

There is ample evidence that Moore’s law will continue through the mid 2020s. Hence the number of doublings in processor power we can expect between now and 2021 is:

3/1.5 = 2 times by 2015 (3 = 2015 – 2012)

6/1.5 = 4 times by 2018 (6 = 2018 – 2012)

9/1.5 = 6 times by 2021 (9 = 2021 – 2012)

So a commodity server year should be about:

2⁶¹ cycles/year in 2012

2² * 2⁶¹ = 2⁶³ cycles/year by 2015

2⁴ * 2⁶¹ = 2⁶⁵ cycles/year by 2018

2⁶ * 2⁶¹ = 2⁶⁷ cycles/year by 2021

Therefore, on commodity hardware, Stevens’ attack should cost approximately:

2⁷⁴ / 2⁶¹ = 2¹³ server years in 2012

2⁷⁴ / 2⁶³ = 2¹¹ server years by 2015

2⁷⁴ / 2⁶⁵ = 2⁹ server years by 2018

2⁷⁴ / 2⁶⁷ = 2⁷ server years by 2021

Today Amazon rents compute time on commodity servers for about $0.04 / hour ~ $350 /year. Assume compute rental fees remain fixed while server capacity keeps pace with Moore’s law. Then, since log₂(350) ~ 8.4 the cost of the attack will be approximately:

2¹³ * 2^8.4 = 2^21.4 ~ $2.77M in 2012

2¹¹ * 2^8.4 = 2^19.4 ~ $700K by 2015

2⁹ * 2^8.4 = 2^17.4 ~ $173K by 2018

2⁷ * 2^8.4 = 2^15.4 ~ $43K by 2021

A collision attack is therefore well within the range of what an organized crime syndicate can practically budget by 2018, and a university research project by 2021.

Since this argument only takes into account commodity hardware and not instruction set improvements (e.g., ARM 8 specifies a SHA-1 instruction), other commodity computing devices with even greater processing power (e.g., GPUs), and custom hardware, the need to transition from SHA-1 for collision resistance functions is probably more urgent than this back-of-the-envelope analysis suggests.

Any increase in the number of cores per CPU, or the number of CPUs per server, also affects these calculations. Also, any improvements in cryptanalysis will further reduce the complexity of this attack.

The point is that we in the community need to start the migration away from SHA-1 and to SHA-2/SHA-3 now.

Tags: cryptanalysis, cryptography, hashes, NIST, SHA-1

Posted on October 5, 2012 at 1:24 PM • 62 Comments

Comments

k • October 5, 2012 2:21 PM

To find the collision, don’t you have to perform 2^60 20 byte compares? How much does the Amazon storage to hold all that data cost, and doesn’t the disk bandwidth affect the performance?

Kevin Bowling • October 5, 2012 2:28 PM

The analysis seems reasonable aside from the Amazon tie in. The commodity server that Amazon rents for that price is an order of magnitude different than the commodity server specified, likely offsetting the cost by an order of magnitude.

But for one final twist, a criminal syndicate would simply use a botnet with vastly different economics than a commercial compute service.

Tomasz Wegrzanowski • October 5, 2012 2:29 PM

“Assume compute rental fees remain fixed while server capacity keeps pace with Moore’s law.”

Anybody who believes this assumption has zero experience with Amazon pricing model.

Amazon prices fall very slowly, much slower than Moore’s law, and they increase very steeply for better hardware.

More realistic attack would be a few grad students and a lot of GPUs.

Ilya Albrekht • October 5, 2012 2:32 PM

Am I correctly understand that 2^14 is per 64 bytes block? I believe this might be over-pessimistic.

According to OpenSSL 1.0.1 source codes, SHA-1 (AVX) is 4.6 cycles/byte on the latest publicly available Ivy Bridge CPU, or ~2^8 clocks per block. So presented estimation might be way worse.

But the question is – does it make sense to switch to SHA-2+ and spend 2-3x more time and power consumption on hashing for things like gmail, skype, online gaming and so on. I believe such information usually doesn’t worth $50K, and if it does people usually understand it and use more secure channels.

Regards,
Ilya

Matt • October 5, 2012 2:35 PM

That must have been a huge napkin!

bcs • October 5, 2012 2:36 PM

It might have been posed here before but; there is a similar type of analysis for AES that might be extrapolatable to this.

http://eprint.iacr.org/2011/710.pdf

bcs • October 5, 2012 3:00 PM

@Ilya Albrekht

log2($64k/$1k) = 9 years

Will the thing protected be worth $1k a decade from now?

meir • October 5, 2012 3:17 PM

The main issue is future cartographic advances. I can assume my data is worth say $20k and conclude SHA-1 is good enough for the next few years. But when will the next cryptanalytic improvement be? A 1000 fold improvement could pop up tomorrow morning (or in a couple of years)
but it is very likely the best attack won’t remain the same. and that is what is difficult to predict. SHA-1 gives us no safety margin.

wumpus • October 5, 2012 4:06 PM

If you want to see what a system to efficiently compute SHA(256), you should look to bitcoin farmers. From what I understand, they used AMD5830s (now 2 generations out of date) if at all available.

According to this site:https://en.bitcoin.it/wiki/Mining_hardware_comparison

~$300 AMD 7950 boards pull 500 hashes/s.
Top of the line Intel CPUs pull no more than 66 hashes/s (the odd behavior due to compiler switches does not inspire confidence either. Don’t count on more than 40 hashs/s even with 6 cores active).
Nvidia does not appear to have bothered adding the missing rotate(?) instruction that rumor claims is killing their performance. Expect significantly better than a CPU, but less than 1/4 that of an equally priced AMD board.

I can’t argue that SHA256 performance will match SHA-1, but it would be a good place to start. I also can’t believe that there aren’t real bitcoin fans following this site.

Randall • October 5, 2012 4:52 PM

Yup @wumpus. Crackers get huge speedups that everyday users don’t, like computing several hashes in parallel on a GPU or special hardware.

Then there’s the potential for new cryptanalysis. And it’ll take years to migrate a chunk of the world’s computers, embedded systems(!), and hardware(!) to a new algorithm.

If we don’t at least start making easy changes now, SHA1 will just follow MD5’s trajectory many years delayed.

(I want to see randomized hashing used wherever possible, too, so we can all worry about collisions much less, but that’s another story.)

BJ • October 5, 2012 6:49 PM

@wumpus,
Another bitcoin fan here…

SHA1 is 160bits, and is simpler than SHA2 (SHA256).
see http://en.wikipedia.org/wiki/SHA-1#Comparison_of_SHA_functions

Since it uses a subset of the operations of SHA2, it should be able to run on AMD GPUs even faster than SHA256.

BJ • October 5, 2012 6:51 PM

Hmm… SHA1 is 80 rounds, vs 64 in SHA2.

Not sure which would be faster, considering less bits and fewer instructions used in SHA1, but I’d guess SHA1 would still be faster.

aikimark • October 5, 2012 6:51 PM

This kickstarter project aims to deliver 16-core devices for $99 and 64-core devices for $199 (45GHz and 90 gigaflops).

http://arstechnica.com/information-technology/2012/09/99-raspberry-pi-sized-supercomputer-touted-in-kickstarter-project/

Let’s have a second look at those estimates and time lines.

Ben Brockert • October 5, 2012 7:59 PM

ELI5: What does finding a collision get you? How do you make hundreds of thousands of dollars off finding one?

aikimark • October 5, 2012 9:12 PM

@Ben

Let’s say you want to perpetrate some financial fraud via a MITM attack (or just a plain old fashioned con). You might need some fake document that passes a hashing test for authenticity.

If doing high stakes fraud, you are probably looking at a $3-$5 million minimum target.

Ilya Albrekht • October 5, 2012 9:35 PM

@BJ SHA-1 considerably faster than SHA-256 (at least on Intel CPUs). 4.6 cycles/byte for SHA-1 vs. 10.3 for SHA-256.

Brian • October 6, 2012 12:09 AM

The Bitcoin comparison suggests the numbers in the original post could dramatically underestimate modern processing power and that a SHA-1 collision could definitely be within reach.

Graphics cards were the state-of-the-art in Bitcoin mining a while ago, but people are moving towards FPGA and even ASIC solutions now. And while they aren’t super widely deployed yet (particularly ASIC chips), they do provide an impressive benchmark.

A particular standalone FPGA based miner costs $600 and can do 800MHash/s of Bitcoin’s double SHA-256 operation. Ignoring hash speed differences, that’s close to 2^30 hash operations per second or 2^55 per year. 32 of those, or about $19k worth, could produce a SHA-1 collision in about a year (assuming 2^60 as a ballpark for collisions).

At the more bleeding edge of the scale, a soon to be available ASIC setup can allegedly perform 30GHash/s for about $650. That’s about 2^35 hash operations per second or the magical 2^60 per year. With the same $19k worth of those, a SHA-1 collision could be found in less than 2 weeks. A larger organization with something like $5 million to play with (so about 8,000 of those ASIC devices) could produce collisions in about 1 hour.

It’s also worth noting that everyone in the Bitcoin network is currently doing about 2^44 hash operations per second total. The network could produce a SHA-1 collision at that rate in about 18 hours.

Obviously those numbers should be taken with a big grain of salt. The collision producing procedure may not be quite a easy to optimize as Bitcoin hashing speed. And in the case of the ASICs, it remains to be seen if they hit their advertised speeds for that price point. And the ASICs and FPGAs are specifically designed for Bitcoin hashing, producing SHA-1 collisions with them directly would not be possible…a different piece of hardware would be needed.

But clearly a SHA-1 collision is within the realm of computing possibility, possibly relatively easily by a reasonably well financed attacker. The fact that anyone can do 2^60 of ANY hash operation in a year with a device that costs $650 definitely suggests SHA-1’s days are seriously numbered.

Clive Robinson • October 6, 2012 2:23 AM

@ k,

… and doesn’t the disk bandwidth affect the performance?

Yes and quite a lot more than people realise, but the usual bandwidth figures quoted in manufactures data sheets are very misleading.

This is becausee there are various steps to retrieving data off of a hard disk and these all add delays which have their own very non linear side effects.

Also just as importantly is the knowledge of how the overall system has been optimized for performance in accessing the hard drives because usually it’s optomised for sequential reads of data not random access to a couple of bytes of data.

Only when you know this information can you work out an effective method of storing the data either for a single user access or for multi simultaneous user access (and the two are vastly different).

Oh and of course the question people rarely ask when talking about large data sets for crypto cracking which is “How long will it take to populate the full data set to start with?”

Jesse Walker • October 6, 2012 7:02 AM

There are a couple of types of comments posted that require a response

The first are the ones that talk about the pricing model. My response is they are fair, and if someone wants to propose a different model then I have to say “Well done,” and the community needs to be grateful for any improvements. There are many ways my model could be wrong, so fix mistakes where you find them. What does not earn my gratitude would be an interminable discussion about whether or not to migrate without any reference to any model or any data.

The second kind of comment requiring comment is grumbling about the performance of SHA-2 or SHA-3. It is unfortunate that the SHA family is getting slower with each generation, but if we have learned anything from the competition it is that building a good hash — one that actually meets its functional requirements — is a lot harder than building a good block cipher. Continuing to use SHA-1 for the collision resistance property that it is known not to possess will eventually cause problems for implementers and for their customers — stop doing it and begin a transition as soon as possible.

Simon Zerafa • October 6, 2012 7:28 AM

Hi,

You really do need to factor in the GP/GPU element into this.

Most if not all password cracking and similar tasks are currently done with GP/GPU based setup’s which are far more capable that CPU based systems.

One example is here:

http://ob-security.info/?p=546

Kind Regards

Simon

Curious • October 6, 2012 7:58 AM

@aikimark

I remember many years ago, there was this US company named Starbridge Systems (or something similar, or maybe Star Bridge Systems if its the same company) that had this peculiar mission statement on their webpage, something about the desire to present affordable supercomputers that had size and the power requirement of a desktop.

Some time later, that mission statement was nowhere to be seen. Apparently, things didn’t work out that well. Maybe things have changed since then, or maybe not.

Konstantin Surkov • October 6, 2012 8:43 AM

0.04/hour price is for 1 core, not 32.

Brian • October 6, 2012 10:13 AM

@Jesse Walker:

I agree, the facts are definitely the facts when it comes to hash security…complaining about the decrease in speed for secure hash algorithms gets us nowhere. If we need collision resistance, complaining about how fast MD5 and SHA1 are isn’t very helpful.

I think a big part of the answer is to broaden the selection of widely supported cryptographic algorithms so people don’t have to use primitives that are slow because they provide properties the user doesn’t need. Collision resistance in a keyed MAC function, for example. Saving the slower (but secure) SHA-2/SHA-3 for when necessary could help quiet speed complaints.

Bruce suggested in another post that NIST do a fast stream cipher competition next. I’d love to see a fast MAC function competition alongside it.

John Wong • October 6, 2012 10:57 AM

I don’t understand why we can’t have multiple machines executing and finding the collision? We certainly can build a rainbow table for SHA-1 and expect it to be complete in a year with enough computers and hard disk, right?

Al Wilson • October 6, 2012 2:19 PM

@wumpus
“If you want to see what a system to efficiently compute SHA(256), you should look to bitcoin farmers. From what I understand, they used AMD5830s (now 2 generations out of date) if at all available.
According to this site:
https://en.bitcoin.it/wiki/Mining_hardware_comparison

“~$300 AMD 7950 boards pull 500 hashes/s.

“Top of the line Intel CPUs pull no more than 66 hashes/s … ”

According to your link your numbers are off by a factor of 1,000,000.

Where you say 500 hashes/s the link gives 500 Mhash/s.
I take Mhash to mean million hashes.

Roger Wolff • October 7, 2012 12:01 AM

Any increase in the number of cores per CPU, or the number of CPUs per server, also affects these calculations.
I don’t agree with this statement. You need the number of cores of current computers to calculate current computing-speed, but Moore takes the number of cores into account.

The reason normal desktop computers now have 2 or 4 cores is to keep up with Moore’s law. The amount of transistors that the process guys can put on a die goes way beyond what the processor guys can use in one processor. So after bigger caches they started putting in more cores.

wumpus • October 7, 2012 12:13 PM

“Where you say 500 hashes/s the link gives 500 Mhash/s.
I take Mhash to mean million hashes.”

The point was to give the ratio between GPU/CPU. Since they are for SHA256, absolute numbers aren’t meaningful for SHA-1.

The real catch is that pulling 500MHashes/sec spits out data at 16GB/s (or 128Gb/s). While hard drive manufacturers love to quote the speed of the interface (6Gb/s or less), a just released consumer SSD just hit 4Gb/s but poor googling implies that cost effective (spinning) hard drives tend to run under 1Gb/s.

It looks like one of those “slack until the technology catches up” problems. I don’t want to know the cost of 160*(2^80)bits of storage that can store data at 128Gb/s

Sheldon • October 8, 2012 4:14 AM

With various thoroughly tested 512-bit hashing functions being available nowadays, would it be safe using a combined hash function like this?

Überhash(input) = Skein(Sha2(input)+input) XOR Keccak(Whirlpool(input)+input)

Where + is binary concatenation, i.e. like prepending a salt.

Clive Robinson • October 8, 2012 6:44 AM

@ Sheldon,

… would it be safe using a combined hash function like this?

I won’t comment on your specific example, however as a general case chaining encryption functions together (in the right way) will increase the security margin. However it is both theoreticaly and practicaly possible to weaken a chained function which is why you should select encryption functions who’s internal functions are in effect orthagonal to each other.

If you have a look at IDEA you will see the Lai-Massey scheme that uses three broadly orthagonal methods of XOR, ADD, MUL in fields, a later cipher called FOX was based on the same idea and is now known as IDEA-NXT and claims it is proof against both Linear and Differential cryptographic attacks, as well as the difficulty of trying to make an algebraic model that could simplify the three orthagonal functions.

Paeniteo • October 8, 2012 7:25 AM

@Sheldon: “Überhash(input) = Skein(Sha2(input)+input) XOR Keccak(Whirlpool(input)+input)”

I’m not exactly sure why you put in the arbitrary separation into two halves that are then simply XORed (parallelization?). You might just want to go for:
Überhash(input) = Skein(Sha-512(Keccak(Whirlpool(input)+input)+input)+input)

k • October 8, 2012 8:13 AM

Ben wrote: “Let’s say you want to perpetrate some financial fraud via a MITM attack (or just a plain old fashioned con). You might need some fake document that passes a hashing test for authenticity.”

But that’s not a 2^60 birthday attack, where any two matches win. If you want to forge a particular existing signature, you’re back to 2^120 or so. Plus, finding any random string that collides isn’t enough. It has to look like a fake document, and the required structure severely restricts your search space. In fact, no collision may exist.

Sheldon • October 8, 2012 11:20 AM

@Paeniteo: I thought the separation would make 100% sure that any potential future weaknesses in either Skein or Keccak would have zero influence in the combined result. But I guess your way of nesting them effectively has the same property.

eerms • October 8, 2012 11:46 AM

@ wumpus

I wouldn’t be surprised that we will see SHA-1 broken more sooner. My bet is in 1 year SHA-1 will follow MD5, and by looking at the recent trends it will not be done by a criminal organization, but by a government. 2.7 Mil in “cyber-security” certainly is not an obstacle for any government.
There should be no problem achieving 128 Gb/s speeds with an not-so advanced storage cluster. A 30-40 node cluster with Infiniband and SSD’s should easily do it. Is the actual size of data to store 160*(2^80)bits? Seems too high for me.

Clive Robinson • October 8, 2012 1:53 PM

@ Bruce Schneier, Jesse Walker,

Over on ARStechnica they have coverage of Jesse’s original comments and this particular blog page,

http://arstechnica.com/security/2012/10/sha1-crypto-algorithm-could-fall-by-2018/

AnonymousHero • October 8, 2012 2:32 PM

@ eerms

Of course, one could say the same about AES. According the Bamford’s famous Wired article a number of months back, NSA has already made “huge breakthroughs” in the cryptanalysis of publicly used algorithms. Whether this means AES or public-key ciphers, he didn’t say (though he did imply it was AES).

He did say that the new Utah data center is crucial to making these breaks practical. They do this, according to him, by what sounds like a ciphertext-only attack. They compare an ungodly number of ciphertexts with each other looking for “telltale patterns.” AES is supposed to be immune to that type of attack, but apparently they have some method of making it work.

Of course, Bamford may have been mislead or the officials, not being cryptographers themselves, may have had the details wrong. In either case, I am certain AES doesn’t stop them. If it did, our whole intelligence gathering apparatus would go dark, and it obviously hasn’t.

To me, it seems simpler to crack RSA. If you do that, then AES becomes moot since most every message sent over the wire will have its AES key encrypted by RSA. So, break RSA and you have all the AES keys. It seems much more likely to me that they have found some practical method of factoring large integers into primes.

I suspect Clive could give more detail on whether such an attack as outlined in the Bamford article is practical (assuming NSA can house a yottabyte of data and utilize the fastest supercomputers).

Clive Robinson • October 8, 2012 7:32 PM

@ AnonymousHero,

So, break RSA and you have all the AES keys. It seems much more likely to me that they have found some practical method of factoring large integers into primes

The NSA don’t need to do to much on factoring large integers into primes to crack RSA.

All they realy need to do is know the random generator used to generate the two primes, and if it is one of the majority that is weak…

As was discussed on this blog a little while ago there is a fairly quick and simple test to show if two or more RSA PK certificates share either of the PQ primes. And a couple of researchers showed that indead a surprising number (ie many) PK certs visable on the internet did indead share primes. This could only be down to the use of weak random number generators.

Now if you know many certs share a prime it would be worth factoring one of the certs as this would enable you to quickly break the other certs.

But how would you factor such a cert, well you don’t have to do that much work due to the weak random number generator.

If you have analysed the generator you will know what it’s likely outputs are and thus your search space for factors is actually quite small.

This is because with many software and hardware packages the PQ pair are generated very early in the packages use when the entropy in the random number generator is actually very small.

Now from the NSA’s point of view they are in general not interested in factoring a specific cert just in factoring as many certs as possible and recovering plaintext to go into the DB.

This is due to several things but two stand head and sholders above the others,

1) To recover the plaintext of a specific message you need to break one certificate. But if you know the coresponding parties you only need to crack one of the two certs to get usable plain text,

2) Which may well due to human failings provide the text of the earlier message within it, due to the fact that with EMail and similar systems hitting “reply” includes the message you are replying to as well as your reply.

Thus although one party may have a very strong random number generator and their cert cannot be easily broken, the party they are communicating with may have a very weak random number generator with a certificate that was very easily broken. As the party with the strong certificate you probably cannot tell if the party you are communicating with has a weak and broken certificate which is alowing your messages to be read because of the “reply” function…

Whilst you can fairly easily generate strong random numbers using dice or other physical objects you tend to get only a few bits for each throw. That is two throws of a dice gives you a number in a range of 36, which for ease of use would be limited to 32 giving 5bits for two throws. These days you need around 2000bits to get a strong PK cert which would be 400 throws of a dice which would take the better part of a morning to do. The random number generator in a lightly loaded Personal Computer could take almost as long to build up the same number of bits of entropy.

In general people don’t want to wait to get good entropy when they can have little or no entropy hashed up to look like good entropy. But the reality is it’s “magic pixie dust” thinking and the likes of the NSA know this and how to exploit it.

Randall • October 8, 2012 10:37 PM

To folks asking if you can combine hash functions: yes. TLS 1.2 uses concat(MD5, SHA1). The Wikipedia article on cryptographic hashes has a section on exactly what that gets you security-wise.

But unlike with MD5 and SHA1, now there’s enough margin and study that practical breaks of SHA-512 or Keccak seem unlikely. Also, if you’re designing a new app, just try and avoid the need for collision resistance: never sign anything containing attacker-controlled data without prepending a random string to it.

As always, once you quit using the crap algorithms, bugs and design flaws quickly become more of an issue than algorithmic strength. It’s something we nerds forget that’s basically been the focus of Bruce’s work for years, and much more important than a new super-nifty new hash.

Randall • October 8, 2012 10:42 PM

Sorry, that should’ve been “TLS before v1.2” not “TLS 1.2.”

AnonymousHero • October 9, 2012 12:48 AM

@ Clive

I have heard you harp about RNG’s before (and I agree many are likely weak as the Lenstra, et al paper proved). Do you suspect this is mostly an embedded device issue (no interface devices to generate entropy) or do you think PRNG’s in OpenSSL or GnuPG are also weak?

And, yes, you make a good point about the private keys. All the attacker needs is to break one guy’s key and the attacker has immediate plaintext of every conversation that guy has had with others (even if the other keys are secure).

For instance, a couple of years ago I used to correspond on an encrypted mailing list with about 50 other people. Most everyone used GnuPG with 2048 bit keys (which should be secure). However, one guy was using 512 bit keys. I got to thinking if an attacker could factor his key (which in 2010 would be very easy and cheap to do), then all 50 of us would be instantly compromised. This is a real problem with public-key systems — you have to be sure your correspondent(s) are using long enough keys and managing their keys properly. Enforcing this gets exponentially more difficult with the more contacts one has.

One the subject of RNG’s, there was a project proposed a number of years ago by some cryptographer to put his own satellite in space that would generate huge quantities of truly random numbers and beam them to earth for instant OTP’s. A TRNG for everyone on earth. His project never went through. Of course, I don’t see any way he could avoid an oppressive government from injecting their own weakened numbers into his stream, thus compromising it. Or perhaps even jamming the signal all together.

Cornerstone • October 9, 2012 3:39 AM

I believe on Linux most programs like OpenSSL and GnuPG use /dev/random or /dev/urandom as RNG source. Is there a known home buildable device you could attach to a USB or other port that could generate better random data? I’d be interested in something like that as a hobby project just because it would be different than what others are using. Is it hard to get better random noise, thermal noise or something like that? I don’t know about math to design something like this but I am adequately skilled to make something if the source method is proven.

moo • October 9, 2012 4:14 AM

Clive’s comments about weak RNGs remind me of last month’s news of the practical attacks against Chip-and-Pin systems:
http://www.schneier.com/blog/archives/2012/09/new_attack_agai_2.html

Short version for those who didn’t read it: a lot of ATMs use crappy RNG generators when choosing the “unpredictable numbers” that are supposed to assure that the transaction is fresh. The “pre-play” attacks are possible because attackers can generate these numbers, and carry out the protocol steps that depend on them, well in advance of the actual transaction.

Clive Robinson • October 9, 2012 6:24 PM

@ AnonymousHero,

Do you suspect this is mostly an embedded device issue (no interface devices to generate entropy) or do you think PRNG’s in OpenSSL or GnuPG are also weak

Without a doubt many but by nomeans all embedded devices have RNGs with poor startup entropy, which then due to initial setup scripts make it into the device PK certs.

Likewise many software RNGs have this startup issue as well. Untill fairly recently many OS / C-lib / etc had realy poor RNG’s and many programers failed to comprehend this or if they did failed to mitigate it correctly.

Even when a good RNG is available in the program or OS there is still the “initial start up” issue where entropy starts at zero and only builds up slowly over time. And also the problem of an attacker injecting faux “known to them” entropy into the generator in various ways.

So yes even the best of software designs can be compromised but in most cases the window of oportunity for an attacker is in the past, unless they can find a way to either compromise a system (such as you mentioned with the mailing list). Or in some way force you to generate a new PK Cert etc on a computer they have managed to compromise.

Oddly the problem of lack of initial start up entropy could be easier to solve with embedded devices than with shrink wrapped software only solutions. This is because most embedded devices get factory programed with serial numbers and other “unique to device” information such as Ethernet MACs prior to being boxed and shipped. It would not be overly difficult for the manufacturer to also put in a stored entropy file that is generated on the fly on the production line.

BUT could you trust them to implement it correctly, not keep records that are accessable and do what is necessary to prevent an attacker getting at the production line system in some way. Further what do you do about the “Reset to factory defaults” option in the embeded devices main menu…

With regards putting up a satellite that generates true random data there are many issues with this that have to be solved, not least of which is how do you solve the eavesdropping issue where both Alice and Eve have the same stream of TRNG data bits.

At the end of the day you realise there are only two solutions to this,

The first is that it is each user who has to be fully responsable for ensuring that not only do they start with True Random Data but that it is also both 100% unique to them as an individual user and more importantly compleatly unknown to others in perpetuity.

Or secondly you have a hierarchical system where individuals are deemed untrustworthy by definition by an organisation that then mandates and enforces every step in the process.

The first process is always going to be subject to the “weakest link” problem of an individual not taking responsability and using short cuts or weak keys etc. The second has the advantage of preventing the weak individual issue.

But the 100% unique and in perpetuity requirments do not go away and these are difficult even in a hierarchical system. This problem along with others has given rise to the notion that True Random Data is not a usable solution at anything other than the master secrets level. For instance 100% unique cannot be done securely with using TRNG data for every user because it requires a hugh database that represents a significant target of oportunity for an attacker. The solution is to use a determanistic system such as a block cipher in CTR mode, with non determanistic sampling of it’s output. But the determanistic system needs to have a sufficiently large output size such that it can be run very rapidly for potentialy centuries and only use a very very small part of it’s output. Which adds weight to Bruce’s comment the other day that we don’t have a block cipher that is sufficiently large in the number of bits in the block size (arguably you currently need 2048bit size RSA keys, but with legal land contracts currently lasting upto 1000years…).

It is without doubt a very hard problem to solve and whatever you do, you will always have cases where it will fail for some reason unless you have full and enforcable control at every step. So is it any wonder that the NSA / GCHQ types of this world have such dictatorial KeyMat proceadures ultimately backed up by life ending sanctions (life long prison terms / execution) for treason.

Clive Robinson • October 9, 2012 7:39 PM

@ Cornerstone,

Is there a known home buildable device you could attach to a USB or other port that could generate better random data? I’d be interested in something like that as a hobby project

The first question is “What do you mean by better?”

The old definition used to be what are often described as True Random Number Generators, but these have only limited application for a whole host of reasons.

More modern thinking is based around fully determanistic algorithms with a small number of master secrets. For instance a counter that starts from a secret number, the output of this counter is then whitened (XORed with another secret) which is then encrypted with AES using a secret key. These sorts of generators are known as Cryptographicaly Secure Pseudo Random Number Generators (CS-PRNGs).

These master secrets are usually generated by a TRNG which has been very carefully designed to remove as many types of bias as the designers can think of and mitigate.

The heart of a TRNG is some kind of “physical noise source” of which many are available and few if any are “perfect” that is free of bias or other influence.

A simple noise source is thermal noise in a resistor, but it is of extreamly low level and has to be amplified. The amplifier adds it’s own noise and any bias it has as well as any external interferance caused by EM fields, sound or other mechanical or thermal energy. The amplifier is likewise not perfect it has a frequency and phase response that alter the thermal noise signal and thus cause bias.

Thus you have to find some way of partialy mitigating the effects to an acceptable level. One way to do this is to use two oscilators one fairly high frequency the other a fairly low frequency. The high frequency oscilator drives the D input of a D-Type latch the low low frequency oscilator clocks the latch and subsiquent circuits. The high frequency oscillator is usually a very low noise low Q oscilator with high “tank energy” that has logrithmic amplitude stabalisation where the frequency is stabalised by a high Q resonator such as an XTAL which is lightly coupled to the oscillator, the output from the oscilator is again very lightly coupled to the tank circuit. This oscilator uses heavily decoupled power supplies and a large amount of screening to prevent external energy from EM, mechanical or thermal sources. The design of such oscilators can be found in many RF refrences as “secondary standards” and the ARRL, RSGB et al have published such designs for use in QRP receivers and test equipment. The second low frequency oscilator is of a similar design but importantly is designed such that the oscilator is not stabalised by an external resonator but has an aditional variable reactive element loosely coupled to the tank to change it’s frequency. This variable reactive element is driven by the physical noise source. The result is the output of the high frequency oscilator is non determanisticaly sampled by the low frequency oscilator and is in effect a sub harmonic of the instantanious difference frequency between the two oscillators.

This output needs to be de-biased the start of this is to use a siple circuit that samples two bits and XORs them together (this is a von neuman de bias circuit) this along with the clock/latch signal is then Digitaly Signal Processed to remove other biases and also detect various failure modes.

Depending on the level of processing you want to do prior to the PC you can do the first stages in something like a PIC or equivalent micro controler that has an in built USB interface.

With a little thought you will realise that you can do the same thing with just the low frequency noise driven oscilator driving an interupt pin on the PIC and some kind of race circuit forground process. One way to do this is to have the forground process be a fast stream generator like ARC4, SNOW2 or any of the EU eStream finalists. The stream generator free runs continuously in the forground and gets sampled by the external interupt signal and this causes the current output byte from the stream generator to be sent over the USB port to the PC.

The important thing is to be very carefull in designing and isolating the noise source and low frequency oscilator such that external influance from power supply noise, EM “hum”, mechanical “microphonics” or temprature changes are minimised as best as posssible. Whilst also continuously monitoting it for out of specification performance that would indicate some form of failure or attack.

What ever you do, do not fall into the trap of “magical thinking” which is best explified by Intel and their idea of using a hash on the output from the physical noise source. This does not improve entropy, nor does it remove bias, and what it does do is make it almost impossible to detect most if not all types of failure of the physical noise source (rumour has it that Intel very deliberatly did this to cover up that their on chip noise source was so bad it was virtualy usless).

Roger Wolff • October 10, 2012 2:38 AM

Coming back to this article a few days later, I get the impression that this calculation is more a “best case” and “if nothing goes wrong” estimate.

It reminds me of those designing crypto themselves and thinking: “If I can’t break it, nobody can”.

None of the things an attacker may come up with were taken into account.

moore's law • October 12, 2012 12:16 AM

This whole analysis if seriously flawed.

While Moore’s law doubles the transistors every 18 months it has not result in doubling processing ability over same period.

http://en.wikipedia.org/wiki/Moore's_law#Transistor_count_versus_computing_performance

John G • October 14, 2012 7:39 AM

May I go back to Ben Brockert’s question (Oct 5, 7:59) and k’s followup (Oct 8, 8:13) about how being able to produce a collision, any collision, is going to help anyone make money by fraud? Presumably the bad guy with the massive computing power (accepting that some organized crime networks might be there when specified, or some governments) wants not just to produce an identifiable collision but to create an identifiable specific document – or part of a document, like a signature – that is not just any collision but a very specific overriding of particular data.

Is that the result of producing a first collision, or will one need years and years of computing power (even at 2022 speeds) to do it for each document, or each attempt?

Is that many real years further down the road, or is that a done deal with the first collision (or the first one whose description is published)?

Team Snowden • November 13, 2013 3:54 PM

“With the same $19k worth of those, a SHA-1 collision could be found in less than 2 weeks. A larger organization with something like $5 million to play with (so about 8,000 of those ASIC devices) could produce collisions in about 1 hour”

+1 I believe NSA’s cryptanalysis hardware budget is quite a bit more than $5 million 😉

I’m not sure what the point is in calculating a COTS threat-model when the most capable adversary is almost certainly using custom-built equipment. There are, after all, just a handful of hash algorithms being used for the vast majority of hash-related security functions. As the leaks have shown, the vulnerabilities introduced by the most capable adversary creates opportunities for exploits by less capable adversaries following in their footsteps. Haven’t we learned from the Snowden leaks that we can no longer afford to design security protocols to anything less than a standard of frustrating the most capable adversary on the most pessimistic – yet reasonable – estimates of its capabilities?

Mike Anthis • November 13, 2013 11:29 PM

How much entropy is available over NTP? Maybe enough to seed hardware RNGs? If a hardware reset can’t reset the seed, doesn’t that go a long way?

Nick Borgers • June 24, 2014 10:39 AM

Leaving aside when collision will occur, is collision, with anywhere near this difficulty, even a problem?

My reading indicates that the file found to collide was found only by that characteristic, not by its readability or format. Even if the ability to find such an arbitrary file with the same hash exists, does it actually affect your ability to verify data?

Would finding a useful (as in capable of misleading people or systems) file face a problem akin to “1000 monkeys on 1000 typewriters producing Shakespeare”?

Peter Green • October 24, 2014 12:51 PM

AIUI the design of the MD5/SHA1/SHA2 family of hashes and of current collision finding techniques means that it is no more difficult to find a collision with a common chosen prefix and suffix than it is to find a basic collision. Combining this with a rich data format where a conditional can be based on the content of the “random garbage” block generated by the collision generator allows you to create two documents with the same hash but very different user-visible content fairly easilly (it may take a few attempts to get a pair of “random garbage” blocks that trigger opposite branches of the conditional). This may allow certain kinds of fraud (for example getting someone to digitally sign one contract and then claiming they signed a different one).

Generating a collision with distinct chosen prefixes is more powerful as it gives you a realistic chance of creating a useful collision in situations when you aren’t using a data format that allows conditional logic and/or situations where you control part of the contents of the file to be hashed and can predict the rest. Such an attack on MD5 was successfully used to mount an attack demonstration on a SSL CA. For MD5 the distinct chosen prefix attack was harder than the basic attack but afaict it was only a few years between the former and the latter.

Duh • March 12, 2015 8:42 PM

The rapture is due in 2017. So I am ok with SHA-1.

Osama • March 15, 2015 8:40 AM

I was working on research projects. How can I get promoted to Organized Crime Syndicate level?

Xerxes Rånby • October 8, 2015 7:42 AM

https://sites.google.com/site/itstheshappening/ – First know SHA-1 collision
“we estimate the SHA-1 collision cost today (i.e., Fall 2015) between 75K$ and 120K$ renting Amazon EC2 cloud computing over a few months.”

NATASHA • December 16, 2015 10:27 PM

Hi, My name natasha and i just want to share my experience with everyone. I have being hearing about this blank ATM card for a while and i never really paid any interest to it because of my doubts. Until one day i discovered a hacking guy called Edwin. he is really good at what he is doing. Back to the point, I inquired about The Blank ATM Card. If it works or even Exist. They told me Yes and that its a card programmed for random money withdraws without being noticed and can also be used for free online purchases of any kind. This was shocking and i still had my doubts. Then i gave it a try and asked for the card and agreed to their terms and conditions. Hoping and praying it was not a scam. One week later i received my card and tried with the closest ATM machine close to me, It worked like magic. I was able to withdraw up to $3000. This was unbelievable and the happiest day of my life. So far i have being able to withdraw up to $28000 without any stress of being caught. I don’t know why i am posting this here, i just felt this might help those of us in need of financial stability. blank Atm has really change my life. If you want to contact them, Here is the email address benson.blankatmcard@gmail.com . And I believe they will also Change your Life.

Luc • September 2, 2016 6:14 AM

I don’t quite get it. Where does the 2^60 come from? SHA1 is 160 bits, not 60.
Brute forcing 2^159 attempts (the average it would take to break 160 bits) with current hardware would take longer than the universe’s lifetime from the big bang through today.

P. Cruiser • February 23, 2017 9:42 AM

Google just announced the first SHA-1 collision. RIP SHA-1.

https://security.googleblog.com/2017/02/announcing-first-sha1-collision.html

ziggy • February 23, 2017 6:52 PM

@Luc see the reference to the Stevens’ Attack. http://2012.sharcs.org/slides/stevens.pdf

Marc Stevens (CWI Amsterdam), is the leader of the collaboration between CWI and Google that announced their ability to create SHA-1 Collisions with files of the same length.
https://security.googleblog.com/2017/02/announcing-first-sha1-collision.html?m=1

This is bad for Application Whitelisting Systems that rely upon a SHA-1 or even a SHA-1 and File Length to verify an executable file’s identify.

ziggy • February 23, 2017 6:57 PM

Thanks Bruce for your calculation back in 2012. This lead us to build an App Trust-Listing System that relies upon five hashes and the file’s length as a cyber-metric handprint to prevent hash collision spoofing like the one just announced by Google today.

I’d love to see your analysis of the possibility of spoofing a file based on the file’s length and its five hashes, SHA-1, SHA-256, SHA-512, MD5 and CRC32.

Thanks in advance…zig

Clive Robinson • February 24, 2017 3:57 AM

@ ziggy,

CRC32 is not realy a security hash for a couple of reasons.

Firstly it’s designed to catch random data errors very efficiently thus is quite linear and very low in complexity in it’s method of calculation.

Secondly, it’s output is only 32bits in length, the usuall “birthday paradox” rule of thumb says you would have around a 50/50 chance of a collision in around half the bit length –square root– attempts or 16 bits or around 65K tries.

More formaly the probability of collision between n texts in H space is typically written as:

1 – e^-(0.5 * n(n-1) * H)

Where n is the number of evenly distributed randomly selected hashes to compare and H is the size of the element count of all possible hashes. As can be seen the “0.5 * n(n-1)” is broadly the same as n*n or n^2 which as a e^-(2), is approx “half the bit length for larger values.

This means that with ~10K hashes you’ve about a 1% chance of a collision, and you stand a 50% chance of collision at ~80K hashes. Which is so small that the security value is close to zero.

Elizabeth Mark • January 2, 2021 3:38 AM

GET RICH WITH BLANK ATM CARD … Whatsapp: +18033921735

I want to testify about Dark Web blank atm cards which can withdraw money from any atm machines around the world. I was very poor before and have no job. I saw so many testimony about how Dark Web hackers send them the atm blank card and use it to collect money in any atm machine and become rich. ( darkwebblankatmcard@gmail.com ) I email them also and they sent me the blank atm card. I have use it to get 90,000 dollars. withdraw the maximum of 5,000 USD daily. Dark Web is giving out the card just to help the poor. Hack and take money directly from any atm machine vault with the use of atm programmed card which runs in automatic mode.

Email: darkwebblankatmcard@gmail.com
Text & Call or WhatsApp: +18033921735

BLANK ATM CARD • March 27, 2021 1:49 AM

GET RICH WITH BLANK ATM CARD … Whatsapp: +18033921735

Email: darkwebblankatmcard@gmail.com
Text & Call or WhatsApp: +18033921735

joel alvarez • July 20, 2021 7:02 PM

Hello, are you guys ready to make real cash??? No dull moments anymore. No more depending on cheap checks every week. Get thousands of dollars or any currency of your choice and make this life worth living for. Order for a blank ATM card now. How does it work? Our cards are loaded with a balance of $5000 to $100,000.00 with different daily withdrawal limits depending on the card you are buying and you can use the blank atm card to shop online and withdraw cash from any ATM machine closer to you.★ Is this real? Yes, as shown in the video we withdrew cash multiple times without any issues. You can do it too.★ Can I be traced? No, your withdrawal/transactions are completely anonymous.★ Can I trust this method? Yes, we have not had any issue when doing this for the past 5 years now.★ Are people using this ATM card? Absolutely, alot of people {our trusted customers) have quit their jobs to withdraw money on a daily basis. ★ How do I get my card? We will ship your Blank Card /wa pin hours after receiving cleared payment through a courier service International and give you the tracking details of your card, 2-4 business day delivery service. Once you receive the card you can start cashing out. ★Is this real? YES: we are 100% real and have been doing this since 2015 Contact us to order a working blanK ATM Card that you can use to withdraw a minimum amount of $1000 and maximum amount of $10,000 daily withdrawal limit. Online maximum purchase limit is $30,000. They also trade on Bitcoin Contact via email: mrmichealblankatmcard@gmail.com or WhatsApp/call: ‪+1 (631) 310‑4959‬

Schneier on Security

When Will We See Collisions for SHA-1?

Comments

Leave a comment Cancel reply