Fix estimated scrub completion time

Currently, it is possible for the 'zpool scrub' command to
progress slightly beyond 100% due to concurrent changes
happening on the live pool. This behavior is expected, but
the userspace code for 'zpool status' would subtract the
expected amount of data from the amount of data already
scrubbed, resulting in a negative integer being casted to a
large positive one. This number was then used to calculate
the estimated completion time, resulting in wildly wrong
results. This code changes the behavior so that 'zpool status'
does not attempt to report an estimate during this period.

Reviewed by: Brian Behlendorf <behlendorf1@llnl.gov>
Reviewed-by: Igor Kozhukhov <igor@dilos.org>
Reviewed-by: George Melikov <mail@gmelikov.ru>
Signed-off-by: Tom Caputi <tcaputi@datto.com>
Closes #8611 
Closes #8687
This commit is contained in:
Tom Caputi 2019-05-01 20:34:24 -04:00 committed by Brian Behlendorf
parent 6bdefad311
commit 85bdc68401
2 changed files with 7 additions and 2 deletions

View File

@ -7011,7 +7011,7 @@ print_scan_status(pool_scan_stat_t *ps)
scan_rate = pass_scanned / elapsed; scan_rate = pass_scanned / elapsed;
issue_rate = pass_issued / elapsed; issue_rate = pass_issued / elapsed;
total_secs_left = (issue_rate != 0) ? total_secs_left = (issue_rate != 0 && total >= issued) ?
((total - issued) / issue_rate) : UINT64_MAX; ((total - issued) / issue_rate) : UINT64_MAX;
days_left = total_secs_left / 60 / 60 / 24; days_left = total_secs_left / 60 / 60 / 24;
@ -7045,7 +7045,8 @@ print_scan_status(pool_scan_stat_t *ps)
} }
if (pause == 0) { if (pause == 0) {
if (issue_rate >= 10 * 1024 * 1024) { if (total_secs_left != UINT64_MAX &&
issue_rate >= 10 * 1024 * 1024) {
(void) printf(gettext(", %llu days " (void) printf(gettext(", %llu days "
"%02llu:%02llu:%02llu to go\n"), "%02llu:%02llu:%02llu to go\n"),
(u_longlong_t)days_left, (u_longlong_t)hours_left, (u_longlong_t)days_left, (u_longlong_t)hours_left,

View File

@ -2189,6 +2189,10 @@ If a scrub is paused, the
resumes it. resumes it.
If a resilver is in progress, ZFS does not allow a scrub to be started until the If a resilver is in progress, ZFS does not allow a scrub to be started until the
resilver completes. resilver completes.
.Pp
Note that, due to changes in pool data on a live system, it is possible for
scrubs to progress slightly beyond 100% completion. During this period, no
completion time estimate will be provided.
.Bl -tag -width Ds .Bl -tag -width Ds
.It Fl s .It Fl s
Stop scrubbing. Stop scrubbing.