Would You Play Russian Roulette with a UUID?

Aug 09, 2022 @ 10:52 pm

Random Numbers I was explaining UUIDs to a junior colleague today, who was hung up on the idea of probabilities, and it lead to an interesting viewpoint.

A UUID is a Universal Unique IDentifier. They’re used all over the place in IT: as database identifiers, unique file names, etc. Here’s one:

67fbd46e-7609-477f-b5c3-edf98bbbb511

I generated that on Linux by typing

uuid -v4

Of course, there are many libraries for all the major programming languages as well.

There are different types of UUIDs, and some depend on the issuer (i.e., you) setting a namespace, which then puts you in charge of ensuring that everything in your namespace is unique. Here I’m talking about v4, which uses for purely random generation.

When I issued the above command, my computer didn’t go talk to some global registry that keeps a list of all the UUIDs issued. Instead, the uuid utility went through a series of random number generations to produce the above. So if my computer generates 67fbd46e-7609-477f-b5c3-edf98bbbb511, what’s to prevent your computer from generating the same number and having a collision?

Nothing. Except math. Usually. And then there’s that revolver gamble.

The odds of two randomly-generating UUIDs colliding (having the same value) is 5.3×10³⁶. For comparison, there are about 6×10²³ silicon atoms in the universe, so you could give every silicon atom its own UUID and still have plenty leftover (the precise calculation is left as an exercise for the reader).

However, you may recall when I said my computer wasn’t checking out some guaranteed unique number from a registry, but generating it randomly itself? Do you see a problem there?

My Linux box is using whatever standard library the uuid utility uses, and I’m sure it’s engineered to be the best possible. But what if you’ve got a laptop running CrapOS and CrapOS has a horrible random number generator? As RFC 4122 puts it:

Distributed applications generating UUIDs at a variety of hosts must
be willing to rely on the random number source at all hosts.

In practice, this is more of a theoretical rather than an actual risk, because everyone uses standard libraries and no one has a motive to do something stupid.

But back to my colleague. Her concern was that there could still be a collision someday. My opinion was that at a certain level of odds, you just assume it’s not going to happen. However, my associate noted that there is always an implicit risk/reward calculation.

If there is a UUID collision at some point, what really is the damage? A database error or some web app can’t process a POST. Since the collision is probably not going to be a serious problem, the risk is acceptable.

But what if someone made you the proposition that you can play a game of Russian roulette and if you survive, you will receive $1 billion. Would you play? Most people would say no, because even though you have an 83% chance of winning, the risk is your life.

What if it was a 20-chamber gun? Even with a 95% chance of a wonderful outcome, you wouldn’t play. A million-chamber game? No.

But one in a UUID? Probably not. But didn’t I just say we “assume it’s not going to happen”? It’s not going to happen. You’re perfectly safe. But there’s a chance…

How about you? Would you pull the trigger in Russian roulette if the there were 5.3×10³⁶ chambers and payoff was $1 billion? Let us know in the comments below.

raindog308

Raindog308 is a longtime LowEndTalk community administrator, technical writer, and self-described techno polymath. With deep roots in the *nix world, he has a passion for systems both modern and vintage, ranging from Unix, Perl, Python, and Golang to shell scripting and mainframe-era operating systems like MVS. He’s equally comfortable with relational database systems, having spent years working with Oracle, PostgreSQL, and MySQL.

As an avid user of LowEndBox providers, Raindog runs an empire of LEBs, from tiny boxes for VPNs, to mid-sized instances for application hosting, and heavyweight servers for data storage and complex databases. He brings both technical rigor and real-world experience to every piece he writes.

Beyond the command line, Raindog is a lover of German Shepherds, high-quality knives, target shooting, theology, tabletop RPGs, and hiking in deep, quiet forests.

His goal with every article is to help users, from beginners to seasoned sysadmins, get more value, performance, and enjoyment out of their infrastructure.

You can find him daily in the forums at LowEndTalk under the handle @raindog308.

Would You Play Russian Roulette with a UUID?

No Comments

Leave a Reply Cancel reply

About LowEndBox

Recent Posts

Popular Posts on LowEndTalk

HostDare VPS Offers - Double RAM & Bandwidth ! USA/JAPAN/BULGARIA !

Looking for VPS with 50+ ipv4 ips

Flash Sale - VPS & Dedicated - India/SG/JP/USA/CA/UK/NL/DE - 25TB BW, 10Gbit From $12

racknerd backup

Virtual Private Server Hosting FAQ

Get notified of new offers