Archive-Team/zfs - zfs - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Brian Behlendorf	73fe782522	Use HAVE_BDEV_LOGICAL_BLOCK_SIZE compat marco in vdev_disk. This is a 2.6.31 API change.	2009-11-12 12:51:16 -08:00
Brian Behlendorf	ed97b4447d	Adds the last missing block device support (merge_bdev support) This change should wrap up the last of the missing block device support in the vdev_disk layer. With this change I can now successfully create and use zpools which are layered on top of md and lvm virtual devices. The following changes include: 1) The big one, properly handle the case when page cannot be added to a bio due to dynamic limitation of a merge_bdev handler. For example the md device will limit a bio to the configured stripe size. Our bio size may also end up being limited by the maximum request size, and other factors determined during bio construction. To handle all of the above cases the code has been updated to handle failures from bio_add_page(). This had been hardcoded to never fail for the prototype proof of concept implementation. In the case of a failure the number of bytes which still need to be added to a bio are returned. New bio's are allocated and attached to the dio until the entire data buffer is mapped to bios. It is then submitted as before to the request queue, and once all the bio's attached to a dio have finished the completion callback is run. 2) The devid comments have been removed because it is not clear to me that we will not need devid support. They have been replaced with a comment explaining that udev can and should be used.	2009-10-27 14:38:38 -07:00
Brian Behlendorf	18eee5d03f	Fix sector size and capacity calculation. Remove the hard coded 512 byte SECTOR_SIZE and replace it with bdev_hardsect_size() to get the correct hardware sector size. Usage of get_capacity() was incorrect. We the block_device references a partition we need to return bdev->part->nr_sects. If get_capacity() is used the entire device size will be returned ignoring partition information. This is however the correct thing to do when the block device in question has not partition table.	2009-10-14 16:02:51 -07:00
Brian Behlendorf	c39498254d	Extra reference required for dio struct in __vdev_disk_physio. Exposed by the fc11 debug kernel we need to hold a reference over all calls to submit_bio(). Otherwise it is possible all the completion callbacks run before we exit __vdev_disk_physio(), and we end up with a GPF. This was quickly exposed when slab poisoning was enabled. I have added helper functions to cleanly track the reference counts. In addition dr->dr_ref was converted from an integer to an atomic type which removes the need for the spinlock. As a nice side effect of these changes the code is now slightly cleaner and clearer.	2009-09-02 15:08:40 -07:00
Brian Behlendorf	9be5ad2924	Flag ZIO_FLAG_SPECULATIVE now issues READ instead of READA. There is concern that READA may do more than simply reorder the queue. There may be an increased chance that a requested marked READA will fail because the elevator considers it optional. For this reason, all read requests, even speculative ones, have been converted back to READ.	2009-08-04 10:37:40 -07:00
Brian Behlendorf	4014bd401f	Major vdev_disk feature and API (thru 2.6.30) update. Tested under CHAOS4.2, RHEL5, SLES11, and FC11 (all x86_64) Features: Honor spa_mode() when opening the block device. Previously this was ignored and devices were always opened read/write. Integrated DKIOCFLUSHWRITECACHE zio operation with linux WRITE_BARRIER for kernels post 2.6.24 where empty bio requests are supported. For earlier kernels ENOTSUP is returned and no barriers are performed. If RHEL5 based kernels are intended to be supported long term we may need make use of the old akward API. With the addition of WRITE_BARRIER support all writes which were WRITE_SYNC can now be safely made WRITE bios. They will now take advantage of aggregation in the elevator and improved write performance is likely. Notice the ZIO_FLAG_SPECULATIVE flag and pass along the hint to the elevator by using READA instead of READ. This provides the elevator the ability to prioritize the real READs ahead of the speculative IO if needed. Implement an initial version of vdev_disk_io_done() which in the case of an EIO error triggers a media change check. If it determines a media change has occured we fail the device and remove it from the config. This logic I'm sure can be improved further but for now it is an improvement over the VERIFY() that no error will ever happen. APIs: 2.6.22 API change Unused destroy_dirty_buffers arg removed from prototype. 2.6.24 API change Empty write barriers are now supported and we should use them. 2.6.24 API change Size argument dropped from bio_endio and bi_end_io, because the bi_end_io is only called once now when the request is complete. There is no longer any need for a size argument. This also means that partial IO's are no longer possibe and the end_io callback should not check bi->bi_size. Finally, the return type was updated to void. 2.6.28 API change open/close_bdev_excl() renamed to open/close_bdev_exclusive(). 2.6.29 API change BIO_RW_SYNC renamed to BIO_RW_SYNCIO.	2009-07-29 17:24:08 -07:00
Brian Behlendorf	1fc5fb504c	BIO_RW_FAILFAST replaced with BIO_RW_FAILFAST_{DEV\|_TRANSPORT\|_DRIVER} Use the legacy BIO_RW_FAILFAST flag if it exists. If it is missing it means we are running against a kernel with the newer API. We should be able to enable some fairly smart behavior one we intergrate with the new API, but until I get around to writing that code just remove the flag entirely. It's not critical for correctness.	2009-07-24 15:16:13 -07:00
Brian Behlendorf	67d31d626f	The bi_end_io API changes make partial IO's impossible Kernel commit 6712ecf8f648118c3363c142196418f89a510b90 which removes the size argument from bio_endio and bi_end_io, also removes the need to handle partial IOs in the handler.	2009-07-24 15:08:09 -07:00
Brian Behlendorf	2cc278c96f	This change looks to have been made due to Sun bug 6803822, but it's not exactly clear to me why. Regardless I'm taking it.	2009-07-09 10:30:06 -07:00
Brian Behlendorf	23c544c884	Add zfs_config.h include for HAVE_2ARGS_BIO_END_IO_T define	2009-03-17 15:14:02 -07:00
Brian Behlendorf	25c88fda18	Remove unused variable	2009-03-11 22:15:36 -07:00
Brian Behlendorf	3657ada547	Update linux vdev_disk interfaces to issue multiple bios if needed due to the maximum request size being smaller than the request size passed down from the spa	2009-01-26 16:46:50 -08:00
Brian Behlendorf	ab8f4ca43f	Convert ASSERT() to VERIFY() for better coverage	2009-01-21 10:59:06 -08:00
Brian Behlendorf	b38c50ac45	Update vdev_disk.c implementation to be compatible with b103 API	2009-01-08 10:25:23 -08:00
Brian Behlendorf	3a1f0dcde1	Refresh prototype due to upstream changes	2009-01-05 16:53:23 -08:00
Brian Behlendorf	b17c1f4123	Don't make this fatal for userspace	2008-12-22 13:32:19 -08:00
Brian Behlendorf	add6c31eed	Update vdev_disk for in-kernel use	2008-12-19 13:34:38 -08:00
Brian Behlendorf	a2d1d32c17	move vdev_disk to it's new home	2008-12-11 15:26:36 -08:00

18 Commits