Mastodawn

I just had a brief conversation involving how reads work under #ZFS, how flash-based SSDs actually perform, and how this means cache devices are only rarely useful. Thought I would elaborate a bit and share more broadly.

First, a quick description of ARC, the Adaptive Replacement Cache. It’s the ZFS read caching layer. It involves an index and two data pools. One pool stores most recently used data. The other pool stores most frequently used data, even if it wasn’t used recently. Any time data is read, the system checks the ARC index to see if the data is cached. If it is, the copy in RAM is returned. Otherwise, the storage is read. For most workloads, this combination works *really* well. The data you request is almost always in RAM, and only a small percentage of requests need to hit the much slower storage.

ZFS cache devices are often called L2ARC. They don’t really use ARC logic, though. Instead, cache devices are populated by data which is about to fall out of ARC. This is a constant trickle of data (usually about 10 MB/s) being written to the cache device. An index of what data is on the cache device is also stored in RAM. The amount of RAM spent on the index depends on the record size (100 GB of small records costs more to index than 100 GB of big records). Now, when data is requested, the system checks the ARC index, then the cache index, then the storage. If the cache index says the cache device has the data, it’s read from there rather than from the storage.

Everybody who has dealt with spinning disks knows they aren’t very good at reading and writing at the same time. The heads have to seek all over the disk. Unfortunately, SSDs have a very similar problem. If you give them an all-read or all-write workload, they’re really fast, but even a 10% mix and performance craters. Tom’s Hardware did an excellent test of an Optane drive a few years ago ( https://www.tomshardware.com/reviews/intel-optane-ssd-900p-3d-xpoint,5292-2.html )which demonstrated this problem. I’ve attached one of the more interesting performance graphs.

Note that at 90% read workload, performance of the non-Optane drives drops to less than half of what it is at 100% read. ZFS cache vdevs have a constant low-level write workload. They’ll never be above 90% read workload.

Meanwhile, the devices used to provide capacity to a ZFS pool only receive write workload when a transaction group closes. By default, this is every five seconds, but it can be less if the transaction group fills early. The rest of the time, there’s no writing. This leaves them free for 100% read workload for seconds at a time, even under heavy write load.

The result of this is that to provide benefit, a cache device has to be significantly faster than the pool devices. With SATA SSDs providing capacity, even an NVMe SSD is probably too slow to be a good cache device.

Show thread

Zimmie Oct 15

I wish to join the bunnies.

Zimmie Sep 23

Zimmie Aug 11

Out for a drive today, and went past this place. Tempted to drop in and try rewriting some recent history.

Zimmie Aug 3

Mixed a sourdough pizza crust yesterday afternoon and just made it for lunch.

225g all-purpose flour
75g whole wheat flour
230g unfed sourdough starter
150g water
1t salt
1t instant yeast

Let it ferment overnight.
Oil a 35cm/14” pan, and let the dough relax in it in the oven at the lowest temperature for 30 minutes.
Add the sauce, herbs, and some of the toppings. Parmesan is okay, but no other cheeses.
Bake at 230C/450F for 7-10 minutes, depending on the amount of sauce and toppings.
Pull the pizza out, add the cheese and the rest of the toppings.
Continue baking for another 10-15 minutes, depending on the amount of cheese and toppings.

Zimmie Jul 8

Looking for a recipe, and I think that second one has a lot of unnecessary ingredients.

Zimmie Jun 23

Visited some friends over the weekend. We had a bonfire for the solstice, then I got to spend a few hours hanging out with a high-percentage wolfdog grabbing my arms and demanding belly rubs.

Zimmie Jun 14

Techbros are still learning a lesson Mattel shared with the world in 1983.

Zimmie Jun 11

Out on a walk, I spotted a little chihuahua with bright white fur (the kind which doesn’t stay white outdoors for long, so obviously a pet) running around without a person or collar. She seemed unwell in a way which looked to me like vestibular disease (severe head tilt, erratic walking). Another person and I kept her out of the street until animal control could get there to catch her. Hopefully she’s chipped!

Zimmie Apr 28

DaShareZone uses Jira? That explains … a lot.