zfs/sys at a073aeb0604b7d410be58135fb9d5c43671af263 - zfs

History

Brian Behlendorf a073aeb060 Add KMC_SLAB cache type For small objects the Linux slab allocator has several advantages over its counterpart in the SPL. These include: 1) It is more memory-efficient and packs objects more tightly. 2) It is continually tuned to maximize performance. Therefore it makes sense to layer the SPLs slab allocator on top of the Linux slab allocator. This allows us to leverage the advantages above while preserving the Illumos semantics we depend on. However, there are some things we need to be careful of: 1) The Linux slab allocator was never designed to work well with large objects. Because the SPL slab must still handle this use case a cut off limit was added to transition from Linux slab backed objects to kmem or vmem backed slabs. spl_kmem_cache_slab_limit - Objects less than or equal to this size in bytes will be backed by the Linux slab. By default this value is zero which disables the Linux slab functionality. Reasonable values for this cut off limit are in the range of 4096-16386 bytes. spl_kmem_cache_kmem_limit - Objects less than or equal to this size in bytes will be backed by a kmem slab. Objects over this size will be vmem backed instead. This value defaults to 1/8 a page, or 512 bytes on an x86_64 architecture. 2) Be aware that using the Linux slab may inadvertently introduce new deadlocks. Care has been taken previously to ensure that all allocations which occur in the write path use GFP_NOIO. However, there may be internal allocations performed in the Linux slab which do not honor these flags. If this is the case a deadlock may occur. The path forward is definitely to start relying on the Linux slab. But for that to happen we need to start building confidence that there aren't any unexpected surprises lurking for us. And ideally need to move completely away from using the SPLs slab for large memory allocations. This patch is a first step. NOTES: 1) The KMC_NOMAGAZINE flag was leveraged to support the Linux slab backed caches but it is not supported for kmem/vmem backed caches. 2) Regardless of the spl_kmem_cache_*_limit settings a cache may be explicitly set to a given type by passed the KMC_KMEM, KMC_VMEM, or KMC_SLAB flags during cache creation. 3) The constructors, destructors, and reclaim callbacks are all functional and will be called regardless of the cache type. 4) KMC_SLAB caches will not appear in /proc/spl/kmem/slab due to the issues involved in presenting correct object accounting. Instead they will appear in /proc/slabinfo under the same names. 5) Several kmem SPLAT tests needed to be fixed because they relied incorrectly on internal kmem slab accounting. With the updated test cases all the SPLAT tests pass as expected. 6) An autoconf test was added to ensure that the __GFP_COMP flag was correctly added to the default flags used when allocating a slab. This is required to ensure all pages in higher order slabs are properly refcounted, see `ae16ed9`. 7) When using the SLUB allocator there is no need to attempt to set the __GFP_COMP flag. This has been the default behavior for the SLUB since Linux 2.6.25. 8) When using the SLUB it may be desirable to set the slub_nomerge kernel parameter to prevent caches from being merged. Original-patch-by: DHE <git@dehacked.net> Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov> Signed-off-by: Prakash Surya <surya1@llnl.gov> Signed-off-by: Tim Chase <tim@chase2k.com> Signed-off-by: DHE <git@dehacked.net> Signed-off-by: Chunwei Chen <tuxoko@gmail.com> Closes #356		2014-05-22 10:28:01 -07:00
..
fm	Change spl-kmod-devel install path	2013-03-14 12:01:05 -07:00
fs	Change spl-kmod-devel install path	2013-03-14 12:01:05 -07:00
sysevent	Change spl-kmod-devel install path	2013-03-14 12:01:05 -07:00
Makefile.am	Emulate illumos interface cv_timedwait_hires()	2013-11-04 09:49:24 -08:00
acl.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
acl_impl.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
atomic.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
attr.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
bitmap.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
bootconf.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
bootprops.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
buf.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
byteorder.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
callb.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
callo.h	Emulate illumos interface cv_timedwait_hires()	2013-11-04 09:49:24 -08:00
cmn_err.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
compress.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
condvar.h	Emulate illumos interface cv_timedwait_hires()	2013-11-04 09:49:24 -08:00
conf.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
console.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
cpupart.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
cpuvar.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
crc32.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
cred.h	Linux 3.8 compat: Use kuid_t/kgid_t when required	2013-08-09 10:09:29 -07:00
ctype.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
ddi.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
debug.h	This patch add a CTASSERT macro for compile time assertion.	2014-04-14 09:28:53 -07:00
dirent.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
disp.h	Add kpreempt() compatibility macro	2013-10-09 13:52:55 -07:00
dkio.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
dklabel.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
dnlc.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
dumphdr.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
efi_partition.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
errno.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
extdirent.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
fcntl.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
file.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
idmap.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
int_limits.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
int_types.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
inttypes.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
isa_defs.h	Add support for aarch64 (ARMv8)	2014-04-25 15:25:32 -07:00
kidmap.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
kmem.h	Add KMC_SLAB cache type	2014-05-22 10:28:01 -07:00
kobj.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
kstat.h	3537 add kstat_waitq_enter and friends	2013-10-25 13:41:52 -07:00
list.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
mkdev.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
mntent.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
modctl.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
mode.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
mount.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
mutex.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
note.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
open.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
param.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
pathname.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
policy.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
pool.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
priv_impl.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
proc.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
processor.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
pset.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
random.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
refstr.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
resource.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
rwlock.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
sdt.h	Define SET_ERROR()	2013-10-09 14:20:46 -07:00
sid.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
signal.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
stat.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
stropts.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
sunddi.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
sunldi.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
sysdc.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
sysevent.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
sysmacros.h	Linux 3.15 compat: NICE_TO_PRIO and PRIO_TO_NICE	2014-05-07 13:38:03 -07:00
systeminfo.h	Simplify hostid logic	2014-04-14 09:04:41 -07:00
systm.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
t_lock.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
taskq.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
thread.h	De-inline spl_kthread_create().	2014-04-09 19:17:12 -07:00
time.h	Emulate illumos interface cv_timedwait_hires()	2013-11-04 09:49:24 -08:00
timer.h	Add ddi_time_after and friends	2014-04-14 09:32:01 -07:00
tsd.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
types.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
types32.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
u8_textprep.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
uio.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
unistd.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
utsname.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
va_list.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
varargs.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
vfs.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
vfs_opreg.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
vmsystm.h	Include linux/vmalloc.h for ARM and Sparc	2014-01-07 10:45:39 -08:00
vnode.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
zmod.h	Refresh links to web site	2013-03-04 19:09:34 -08:00
zone.h	Refresh links to web site	2013-03-04 19:09:34 -08:00