OpenZFS on Linux and FreeBSD

Go to file

Alexander Motin 891568c990 Split dmu_zfetch() speculation and execution parts To make better predictions on parallel workloads dmu_zfetch() should be called as early as possible to reduce possible request reordering. In particular, it should be called before dmu_buf_hold_array_by_dnode() calls dbuf_hold(), which may sleep waiting for indirect blocks, waking up multiple threads same time on completion, that can significantly reorder the requests, making the stream look like random. But we should not issue prefetch requests before the on-demand ones, since they may get to the disks first despite the I/O scheduler, increasing on-demand request latency. This patch splits dmu_zfetch() into two functions: dmu_zfetch_prepare() and dmu_zfetch_run(). The first can be executed as early as needed. It only updates statistics and makes predictions without issuing any I/Os. The I/O issuance is handled by dmu_zfetch_run(), which can be called later when all on-demand I/Os are already issued. It even tracks the activity of other concurrent threads, issuing the prefetch only when _all_ on-demand requests are issued. For many years it was a big problem for storage servers, handling deeper request queues from their clients, having to either serialize consequential reads to make ZFS prefetcher usable, or execute the incoming requests as-is and get almost no prefetch from ZFS, relying only on deep enough prefetch by the clients. Benefits of those ways varied, but neither was perfect. With this patch deeper queue sequential read benchmarks with CrystalDiskMark from Windows via iSCSI to FreeBSD target show me much better throughput with almost 100% prefetcher hit rate, comparing to almost zero before. While there, I also removed per-stream zs_lock as useless, completely covered by parent zf_lock. Also I reused zs_blocks refcount to track zf_stream linkage of the stream, since I believe previous zs_fetch == NULL check in dmu_zfetch_stream_done() was racy. Delete prefetch streams when they reach ends of files. It saves up to 1KB of RAM per file, plus reduces searches through the stream list. Block data prefetch (speculation and indirect block prefetch is still done since they are cheaper) if all dbufs of the stream are already in DMU cache. First cache miss immediately fires all the prefetch that would be done for the stream by that time. It saves some CPU time if same files within DMU cache capacity are read over and over. Reviewed-by: Brian Behlendorf <behlendorf1@llnl.gov> Reviewed-by: Adam Moss <c@yotes.com> Reviewed-by: Matthew Ahrens <mahrens@delphix.com> Signed-off-by: Alexander Motin <mav@FreeBSD.org> Sponsored-By: iXsystems, Inc. Closes #11652		2021-03-19 22:56:11 -07:00
.github	CI checkstyle: pin ubuntu version	2021-03-11 17:11:31 -08:00
cmd	Fix zfs_get_data access to files with wrong generation	2021-03-19 22:53:31 -07:00
config	Linux 5.12 update: bio_max_segs() replaces BIO_MAX_PAGES	2021-03-19 22:33:42 -07:00
contrib	dracut: Fix race condition between load-key and import	2021-01-26 12:14:22 -08:00
etc	zfs-import-{cache,scan}: change condition to FileNotEmpty	2021-02-05 11:25:22 -08:00
include	Split dmu_zfetch() speculation and execution parts	2021-03-19 22:56:11 -07:00
lib	zpool import cachefile improvements	2021-03-12 15:42:27 -08:00
man	Fix typo in zgenhostid.8	2021-03-19 22:39:42 -07:00
module	Split dmu_zfetch() speculation and execution parts	2021-03-19 22:56:11 -07:00
rpm	Add "compatibility" property for zpool feature sets	2021-02-17 21:30:45 -08:00
scripts	Fix error message when zfs module are already unloaded	2021-02-20 20:23:10 -08:00
tests	Fix regression in POSIX mode behavior	2021-03-19 22:50:46 -07:00
udev	Centralize variable substitution	2020-07-14 17:33:44 -07:00
.editorconfig	Add an .editorconfig; document git whitespace settings	2020-01-27 13:32:52 -08:00
.gitignore	Add FreeBSD support to OpenZFS	2020-04-14 11:36:28 -07:00
.gitmodules	Add zimport.sh compatibility test script	2014-02-21 12:10:31 -08:00
AUTHORS	Add zstd support to zfs	2020-08-20 10:30:06 -07:00
CODE_OF_CONDUCT.md	Replace ZFS on Linux references with OpenZFS	2020-10-08 20:10:13 -07:00
COPYRIGHT	Fix typos	2020-06-09 21:24:09 -07:00
LICENSE	Update build system and packaging	2018-05-29 16:00:33 -07:00
META	Linux 5.11 compat: META	2021-02-10 10:11:21 -08:00
Makefile.am	cppcheck: integrete cppcheck	2021-01-26 16:12:26 -08:00
NEWS	Fix NEWS file	2020-08-26 21:44:41 -07:00
NOTICE	Update build system and packaging	2018-05-29 16:00:33 -07:00
README.md	Update FreeBSD versions	2021-03-16 15:03:28 -07:00
TEST	Remove CI builder customization from TEST	2020-03-16 10:46:03 -07:00
autogen.sh	Cause autogen.sh to fail if autoreconf fails	2018-07-06 09:27:37 -07:00
configure.ac	ZTS: Add tests for DOS mode attributes	2021-03-16 15:00:14 -07:00
copy-builtin	Replace ZFS on Linux references with OpenZFS	2020-10-08 20:10:13 -07:00
zfs.release.in	Move zfs.release generation to configure step	2012-07-12 12:22:51 -07:00

README.md

OpenZFS is an advanced file system and volume manager which was originally developed for Solaris and is now maintained by the OpenZFS community. This repository contains the code for running OpenZFS on Linux and FreeBSD.

Official Resources

Documentation - for using and developing this repo
ZoL Site - Linux release info & links
Mailing lists
OpenZFS site - for conference videos and info on other platforms (illumos, OSX, Windows, etc)

Installation

Full documentation for installing OpenZFS on your favorite operating system can be found at the Getting Started Page.

Contribute & Develop

We have a separate document with contribution guidelines.

We have a Code of Conduct.

Release

OpenZFS is released under a CDDL license. For more details see the NOTICE, LICENSE and COPYRIGHT files; UCRL-CODE-235197

Supported Kernels

The META file contains the officially recognized supported Linux kernel versions.
Supported FreeBSD versions are any supported branches and releases starting from 12.2-RELEASE.