Page MenuHomeFreeBSD

A problem with ASYNC drain
AbandonedPublic

Authored by mmacy on Jul 6 2016, 11:08 AM.
Tags
None
Referenced Files
F81969580: D7135.diff
Sat, Dec 14, 7:45 AM
Unknown Object (File)
Feb 27 2024, 10:26 AM
Unknown Object (File)
Feb 27 2024, 10:13 AM
Unknown Object (File)
Feb 27 2024, 1:59 AM
Unknown Object (File)
Feb 22 2024, 10:00 AM
Unknown Object (File)
Feb 22 2024, 10:00 AM
Unknown Object (File)
Feb 22 2024, 10:00 AM
Unknown Object (File)
Feb 2 2024, 10:22 AM

Details

Reviewers
glebius
jch
rrs
Group Reviewers
transport
Summary

If a thread is running async-drain (like TCP does) its possible some other thread may attempt to restart
the timer. If that occurs the thread should be given a error return and the timer not started (just like
the synchronous drain).

Note that Gleb's previous patch to the callout system fixes 1/2 of the problem this rarer case is still possible and
needs to be addressed :-)

Test Plan

Run it in a netflix work load as well as create a test module to create the exact circumstance.

Diff Detail

Lint
Lint Skipped
Unit
Tests Skipped

Event Timeline

rrs retitled this revision from to A problem with ASYNC drain.
rrs updated this object.
rrs edited the test plan for this revision. (Show Details)
rrs added reviewers: glebius, transport.

Originally I had started working on this before Gleb decided to take over and clean it up. I
had fixed the three issues I saw:

  1. Principle - the start of a new timer was not rejected (like it is for sync drain) for async-drain.
  2. A three way leak in the callout start
  3. An incorrect return on some cases with async_drain and tcp.

I think these were the real issues that Gleb went in to fix when he re-wrote a block of
the code and started breaking thing due to the change in return codes and others.

I decided to go back to this version, which I know works and NF is testing (and will soon release
into production).

If Gleb wants he can hack on his re-write and figure out how to make it work.. or he can use this
here and it will work.. I have no preference either way.. I just know that this code does work :-)

Yes, see rS302894, rS302997, rS302998, rS303037, rS303096, and rS303098.

Short version: @glebius eventually fixed it.

mmacy abandoned this revision.
mmacy added a reviewer: rrs.