[Cerowrt-devel] Ideas on how to simplify and popularize bufferbloat control for consideration.

Discussion:

Frits Riep

2014-05-20 22:11:50 UTC

The concept of eliminating bufferbloat on many more routers is quite
appealing. Reading some of the recent posts makes it clear there is a
desire to get to a stable code, and also to find a new platform beyond the
current Netgear. However, as good as some of the proposed platforms maybe
for developing and for doing all of the new capabilities of CeroWRT, I also
would like to propose that there also be some focus on reaching a wider and
less sophisticated audience to help broaden the awareness and make control
of bufferbloat more available and easier to attain for more users.

. It appears there is a desire to merge the code into an upcoming
OpenWRT barrier breaker release, which is excellent as it will make it
easier to fight buffer bloat on a wide range of platforms and provide users
with a much easier to install firmware release. I'd like to be able to
download luci-qos-scripts and sqm-scripts and have basic bufferbloat control

Dave Taht

2014-05-20 23:14:03 UTC

Permalink

I agree that reaching more users is important. I disagree we need to reach

Post by Frits Riep
· It appears there is a desire to merge the code into an upcoming
OpenWRT barrier breaker release, which is excellent as it will make it
easier to fight buffer bloat on a wide range of platforms and provide users
with a much easier to install firmware release. I’d like to be able to
download luci-qos-scripts and sqm-scripts and have basic bufferbloat control
on a much greater variety of devices and to many more users.

Frits Riep

2014-05-21 11:42:47 UTC

Permalink

Thanks Dave for your responses. Based on this, it is very good that qos-scripts is available now through openwrt, and as I experienced, it provides a huge advantage for most users. I would agree prioritizing ping is in and of itself not the key goal, but based on what I've read so far, fq-codel provides dramatically better responsiveness for any interactive application such as web-browsing, voip, or gaming, so it qos-scripts would be advantageous for users like your mom if she were in an environment where she had a slow and shared internet connection. Is that a valid interpretation? I am interested in further understanding the differences based on the brief differences you provide. It is true that few devices provide DSCP marking, but if the latency is controlled for all traffic, latency sensitive traffic benefits tremendously even without prioritizing by l7 (layer 7 ?). Is this interpretation also valid?

Yes, your mom wouldn't be a candidate for setting up ceroWRT herself, but if it were set up for her, or if it could be incorporated into a consumer router with automatically determining speed parameters, she would benefit totally from the performance improvement. So the technology ultimately needs to be taken mainstream, and yes that is a huge task.

Frits

-----Original Message-----
From: Dave Taht [mailto:***@gmail.com]
Sent: Tuesday, May 20, 2014 7:14 PM
To: Frits Riep
Cc: cerowrt-***@lists.bufferbloat.net
Subject: Re: [Cerowrt-devel] Ideas on how to simplify and popularize bufferbloat control for consideration.

Post by Frits Riep
The concept of eliminating bufferbloat on many more routers is quite
appealing. Reading some of the recent posts makes it clear there is a
desire to get to a stable code, and also to find a new platform
beyond the current Netgear. However, as good as some of the proposed
platforms maybe for developing and for doing all of the new
capabilities of CeroWRT, I also would like to propose that there also
be some focus on reaching a wider and less sophisticated audience to
help broaden the awareness and make control of bufferbloat more available and easier to attain for more users.
· It appears there is a desire to merge the code into an upcoming
OpenWRT barrier breaker release, which is excellent as it will make it
easier to fight buffer bloat on a wide range of platforms and provide
users with a much easier to install firmware release. I’d like to be
able to download luci-qos-scripts and sqm-scripts and have basic
bufferbloat control on a much greater variety of devices and to many
more users.

d***@reed.com

2014-05-21 14:51:33 UTC

Permalink

Besides deployment in cerowrt and openwrt, what would really have high leverage is that the techniques developed in cerowrt's exploration (including fq_codel) get deployed where they should be deployed: in the access network systems: CMTS's, DSLAM's, Enterprise boundary gear, etc. from the major players.

Cerowrt's fundamental focus has been proving that the techniques really, really work at scale.

However, the fundamental "bloat-induced" experiences are actually occurring due to bloat at points where "fast meets slow". Cerowrt can't really fix the problem in the download direction (currently not so bad because of high download speeds relative to upload speeds in the US - that's in the CMTS's and DSLAM's.

What's depressing to me is that the IETF community spends more time trying to convince themselves that bloat is only a theoretical problem, never encountered in the field. In fact, every lab I've worked at (including the startup accelerator where some of my current company work) has had the network managers complaining to me that a single heavy FTP I'm running causes all of the other users in the site to experience terrible web performance. But when they call Cisco or F5 or whomever, they get told "there's nothing to do but buy complicated flow-based traffic management boxes to stick in line with the traffic (so they can "slow me down").

Bloat is the most common invisible elephant on the Internet. Just fixing a few access points is a start, but even if we fix all the access points so that uploads interfere less, there's still more impact this one thing can have.

So, by all means get this stuff into mainstream, but it's time to start pushing on the access network technology companies (and there are now open switches from Cumulus and even Arista to hack)

Post by Frits Riep
Thanks Dave for your responses. Based on this, it is very good that qos-scripts
is available now through openwrt, and as I experienced, it provides a huge
advantage for most users. I would agree prioritizing ping is in and of itself not
the key goal, but based on what I've read so far, fq-codel provides dramatically
better responsiveness for any interactive application such as web-browsing, voip,
or gaming, so it qos-scripts would be advantageous for users like your mom if she
were in an environment where she had a slow and shared internet connection. Is
that a valid interpretation? I am interested in further understanding the
differences based on the brief differences you provide. It is true that few
devices provide DSCP marking, but if the latency is controlled for all traffic,
latency sensitive traffic benefits tremendously even without prioritizing by l7
(layer 7 ?). Is this interpretation also valid?
Yes, your mom wouldn't be a candidate for setting up ceroWRT herself, but if it
were set up for her, or if it could be incorporated into a consumer router with
automatically determining speed parameters, she would benefit totally from the
performance improvement. So the technology ultimately needs to be taken
mainstream, and yes that is a huge task.
Frits
-----Original Message-----
Sent: Tuesday, May 20, 2014 7:14 PM
To: Frits Riep
Subject: Re: [Cerowrt-devel] Ideas on how to simplify and popularize bufferbloat
control for consideration.

easier to attain for more users.
I agree that reaching more users is important. I disagree we need to reach them

Post by Frits Riep
Â· It appears there is a desire to merge the code into an

upcoming

Post by Frits Riep
OpenWRT barrier breaker release, which is excellent as it will make it
easier to fight buffer bloat on a wide range of platforms and provide
users with a much easier to install firmware release. Iâd like to be
able to download luci-qos-scripts and sqm-scripts and have basic
bufferbloat control on a much grea

Dave Taht

2014-05-21 15:19:55 UTC

Permalink

Post by d***@reed.com
Besides deployment in cerowrt and openwrt, what would really have high
leverage is that the techniques developed in cerowrt's exploration
(including fq_codel) get deployed where they should be deployed: in the
access network systems: CMTS's, DSLAM's, Enterprise boundary gear, etc. from
the major players.

+10.

Post by d***@reed.com
Cerowrt's fundamental focus has been proving that the techniques really,
really work at scale.

That they even work on a processor designed in 1990! :)

I also have hoped that along the way we've shown what techniques don't
work...

Post by d***@reed.com
However, the fundamental "bloat-induced" experiences are actually occurring
due to bloat at points where "fast meets slow". Cerowrt can't really fix
the problem in the download direction (currently not so bad because of high
download speeds relative to upload speeds in the US - that's in the CMTS's
and DSLAM's.

Well, I disagree somewhat. The downstream shaper we use works quite
well, until we run out of cpu at 50mbits. Testing on the ubnt edgerouter
has had the inbound shaper work up a little past 100mbits. So there is
no need (theoretically) to upgrade the big fat head ends if your cpe is
powerful enough to do the job. It would be better if the head ends did it,
of course....

Post by d***@reed.com
What's depressing to me is that the IETF community spends more time trying
to convince themselves that bloat is only a theoretical problem, never
encountered in the field. In fact, every lab I've worked at (including the

It isn't all the IETF. Certainly google gets it and has made huge strides.
reduced RTT = money.

My own frustration comes from papers that are testing this stuff at 4mbit
or lower and not seeing the results we get above that, on everything.

https://plus.google.com/u/0/107942175615993706558/posts/AbeHRY9vzLR

ns2 and ns3 could use some improvements...

Post by d***@reed.com
startup accelerator where some of my current company work) has had the
network managers complaining to me that a single heavy FTP I'm running
causes all of the other users in the site to experience terrible web
performance. But when they call Cisco or F5 or whomever, they get told
"there's nothing to do but buy complicated flow-based traffic management
boxes to stick in line with the traffic (so they can "slow me down").

It is sad that F5, in particular, doesn't have a sane solution. Their whole
approach is to have a "load-balancer" and fq_codel is a load-balancer to
end all load balancers.

I do note nobody I know has ported BQL or fq_codel to bsd (codel is in bsd now)

Post by d***@reed.com
Bloat is the most common invisible elephant on the Internet. Just fixing a

+10.

Post by d***@reed.com
few access points is a start, but even if we fix all the access points so
that uploads interfere less, there's still more impact this one thing can
have.

I was scared silly at the implications 2 years back, I am more sanguine
now.

Post by d***@reed.com
So, by all means get this stuff into mainstream, but it's time to start
pushing on the access network technology companies (and there are now open
switches from Cumulus and even Arista to hack)

Oh, cool! I keep waiting for my parallella to show up so I could start
fiddling with ethernet in the fpga....

Post by d***@reed.com

Thanks Dave for your responses. Based on this, it is very good that
qos-scripts
is available now through openwrt, and as I experienced, it provides a huge
advantage for most users. I would agree prioritizing ping is in and of
itself not
the key goal, but based on what I've read so far, fq-codel provides dramatically
better responsiveness for any interactive application such as
web-browsing, voip,
or gaming, so it qos-scripts would be advantageous for users like your mom if she
were in an environment where she had a slow and shared internet
connection. Is
that a valid interpretation? I am interested in further understanding the
differences based on the brief differences you provide. It is true that
few
devices provide DSCP marking, but if the latency is controlled for all traffic,
latency sensitive traffic benefits tremendously even without prioritizing by l7
(layer 7 ?). Is this interpretation also valid?
Yes, your mom wouldn't be a candidate for setting up ceroWRT herself, but if it
were set up for her, or if it could be incorporated into a consumer router with
automatically determining speed parameters, she would benefit totally from the
performance improvement. So the technology ultimately needs to be taken
mainstream, and yes that is a huge task.
Frits
-----Original Message-----
Sent: Tuesday, May 20, 2014 7:14 PM
To: Frits Riep
Subject: Re: [Cerowrt-devel] Ideas on how to simplify and popularize bufferbloat
control for consideration.

easier to attain for more users.
I agree that reaching more users is important. I disagree we need to reach them

Post by Frits Riep
· It appears there is a desire to merge the code into an

upcoming

Post by Frits Riep
OpenWRT barrier breaker release, which is excellent as it will make it
easier to fight buffer bloat on a wide range of platforms and provide
users with a much easier to install firmware release. I’d like to be
able to download luci-qos-scripts and sqm-scripts and have basic
bufferbloat control on a much greater variety of devices and to many
more users.

d***@reed.com

2014-05-21 16:03:08 UTC

Permalink

Post by Dave Taht
Well, I disagree somewhat. The downstream shaper we use works quite
well, until we run out of cpu at 50mbits. Testing on the ubnt edgerouter
has had the inbound shaper work up a little past 100mbits. So there is
no need (theoretically) to upgrade the big fat head ends if your cpe is
powerful enough to do the job. It would be better if the head ends did it,
of course....

There is an advantage for the head-ends doing it, to the extent that each edge device has no clarity about what is happening with all the other cpe that are sharing that head-end. When there is bloat in the head-end even if all cpe's sharing an upward path are shaping themselves to the "up to" speed the provider sells, they can go into serious congestion if the head-end queues can grow to 1 second or more of sustained queueing delay. My understanding is that head-end queues have more than that. They certainly do in LTE access networks.

Dave Taht

2014-05-21 16:30:12 UTC

Permalink

Post by d***@reed.com

There is an advantage for the head-ends doing it, to the extent that each
edge device has no clarity about what is happening with all the other cpe
that are sharing that head-end. When there is bloat in the head-end even if
all cpe's sharing an upward path are shaping themselves to the "up to" speed
the provider sells, they can go into serious congestion if the head-end
queues can grow to 1 second or more of sustained queueing delay. My
understanding is that head-end queues have more than that. They certainly
do in LTE access networks.

Compelling argument! I agree it would be best for the devices that have the
most information about the network to manage themselves better.

It is deeply ironic to me that I'm arguing for an e2e approach on fixing
the problem in the field, with you!

http://en.wikipedia.org/wiki/End-to-end_principle

--
Dave Täht

NSFW: https://w2.eff.org/Censorship/Internet_censorship_bills/russell_0296_indecent.article

d***@reed.com

2014-05-21 17:55:07 UTC

Permalink

The end-to-end argument against putting functionality in the network is a modularity principle, as you know. The exception is when there is a function that you want to provide that is not strictly end-to-end.

Congestion is one of them - there is a fundamental issue with congestion that it happens because of collective actions among independent actors.

So if you want to achieve the goals of the modularity principle, you need to find either a) the minimal sensing and response you can put in the network that allows the independent actors to "cooperate", or b) require the independent actors to discover and communicate amongst each other individually.

Any solution that tries to satisfy the modularity principle has the property that it provides sufficient information, in a sufficiently timely manner, for the independent actors to respond "cooperatively" to resolve the issue (by reducing their transmission volume in some - presumably approximately fair - way).

Sufficiently timely is bounded by the "draining time" of a switch's outbound link's queue. For practical applications of the Internet today, the "draining time" should never exceed about 30-50 msec., at the outbound link's rate. However, the optimal normal depth of the queue should be no larger than the size needed to keep the outbound link continuously busy at its peak rate whatever that is (for a shared WiFi access point the peak rate is highly variable as you know).

This suggests that the minimal function the network must provide to the endpoints is the packet's "instantaneous" contribution to the draining time of the most degraded link on the path.

Given this information, a pair of endpoints know what to do. If it is a receiver-managed windowed protocol like TCP, the window needs to be adjusted to minimize the contribution to the "draining time" of the currently bottlenecked node, to stop pipelined flows from its sender as quickly as possible.

In that case, cooperative behavior is implicit. The bottleneck switch needs only to inform all independent flows of their contribution, and with an appropriate control loop on each node, approximate fairness can result.

And this is the most general approach. Switches have no idea of the "meaning" of the flows, so beyond timely and accurate reporting, they can't make useful decisions about fixing congestion.

Note that this all is an argument about architectural principles and the essence of the congestion problem.

I could quibble about whether fq_codel is the simplest or best choice for the minimal functionality an "internetwork" could provide. But it's pretty nice and simple. Not clear it works for a decentralized protocol like WiFi as a link - but something like it would seem to be the right thing.

Post by Dave Taht

Post by d***@reed.com

it,

Post by d***@reed.com

Post by Dave Taht
of course....

There is an advantage for the head-ends doing it, to the extent that each
edge device has no clarity about what is happening with all the other cpe
that are sharing that head-end. When there is bloat in the head-end even if
all cpe's sharing an upward path are shaping themselves to the "up to" speed
the provider sells, they can go into serious congestion if the head-end
queues can grow to 1 second or more of sustained queueing delay. My
understanding is that head-end queues have more than that. They certainly
do in LTE access networks.

Compelling argument! I agree it would be best for the devices that have the
most information about the network to manage themselves better.
It is deeply ironic to me that I'm arguing for an e2e approach on fixing
the problem in the field, with you!
http://en.wikipedia.org/wiki/End-to-end_principle
--
Dave TÃ€ht
https://w2.eff.org/Censorship/Internet_censorship_bills/russell_0296_indecent.article

Jim Gettys

2014-05-21 17:47:06 UTC

Permalink

Post by d***@reed.com

it,

Post by Dave Taht
of course....

There is an advantage for the head-ends doing it, to the extent that each
edge device has no clarity about what is happening with all the other cpe
that are sharing that head-end. When there is bloat in the head-end even if
all cpe's sharing an upward path are shaping themselves to the "up to"
speed the provider sells, they can go into serious congestion if the
head-end queues can grow to 1 second or more of sustained queueing delay.
My understanding is that head-end queues have more than that. They
certainly do in LTE access networks.

âI have measured 200ms on a 28Mbps LTE quadrant to a single station. This
was using the simplest possible test on an idle cell. Easy to see how that
can grow to the second range.

Similarly, Dave Taht and I took data recently that showed a large
downstream buffer at the CMTS end (line card?), IIRC, it was something like
.25 megabyte, using a UDP flooding tool.

As always, there may be multiple different buffers lurking in these complex
devices, which may only come into play when different parts of them
"bottleneck", just as we found many different buffering locations inside of
Linux. In fact, some of these devices include Linux boxes (though I do not
know if they are on the packet forwarding path or not).

Bandwidth shaping downstream of those bottlenecks can help, but only to a
degree, and I believe primarily for "well behaved" long lived elephant
flows. Offload engines on servers and coalescing acks in various equipment
makes the degree of help, particularly for transient behavior such as
opening a bunch of TCP connections simultaneously and downloading the
elements of a web page I believe are likely to put large bursts of packets
into these queues, causing transient poor latency. I think we'll get a bit
of help out of the packet pacing code that recently went into Linux (for
well behaved servers) as it deploys. Thanks to Eric Dumazet for that work!
Ironically, servers get updated much more frequently than these middle
boxes, as far as I can tell.

Somehow we gotta get the bottlenecks in these devices (broadband &
cellular) to behave better.
- Jim

Post by d***@reed.com
_______________________________________________
Cerowrt-devel mailing list
https://lists.bufferbloat.net/listinfo/cerowrt-devel

Dave Taht

2014-05-21 17:53:29 UTC

Permalink

Post by Jim Gettys

Post by d***@reed.com

There is an advantage for the head-ends doing it, to the extent that each
edge device has no clarity about what is happening with all the other cpe
that are sharing that head-end. When there is bloat in the head-end even if
all cpe's sharing an upward path are shaping themselves to the "up to" speed
the provider sells, they can go into serious congestion if the head-end
queues can grow to 1 second or more of sustained queueing delay. My
understanding is that head-end queues have more than that. They certainly
do in LTE access networks.

I have measured 200ms on a 28Mbps LTE quadrant to a single station. This
was using the simplest possible test on an idle cell. Easy to see how that
can grow to the second range.
Similarly, Dave Taht and I took data recently that showed a large downstream
buffer at the CMTS end (line card?), IIRC, it was something like .25
megabyte, using a UDP flooding tool.

No it was twice that. The udpburst tool is coming along nicely, but still
needs some analytics against the departure rate to get it right.

Post by Jim Gettys
As always, there may be multiple different buffers lurking in these complex
devices, which may only come into play when different parts of them
"bottleneck", just as we found many different buffering locations inside of
Linux. In fact, some of these devices include Linux boxes (though I do not
know if they are on the packet forwarding path or not).
Bandwidth shaping downstream of those bottlenecks can help, but only to a
degree, and I believe primarily for "well behaved" long lived elephant
flows. Offload engines on servers and coalescing acks in various equipment
makes the degree of help, particularly for transient behavior such as
opening a bunch of TCP connections simultaneously and downloading the
elements of a web page I believe are likely to put large bursts of packets
into these queues, causing transient poor latency. I think we'll get a bit
of help out of the packet pacing code that recently went into Linux (for
well behaved servers) as it deploys. Thanks to Eric Dumazet for that work!
Ironically, servers get updated much more frequently than these middle
boxes, as far as I can tell.
Somehow we gotta get the bottlenecks in these devices (broadband & cellular)
to behave better.

Or we can take a break, and write books about how we learned to relax and
stop worrying about the bloat.

Post by Jim Gettys
- Jim

Post by d***@reed.com
_______________________________________________
Cerowrt-devel mailing list
https://lists.bufferbloat.net/listinfo/cerowrt-devel

--
Dave Täht

NSFW: https://w2.eff.org/Censorship/Internet_censorship_bills/russell_0296_indecent.article

d***@reed.com

2014-05-21 17:56:37 UTC

Permalink

Post by Dave Taht
Or we can take a break, and write books about how we learned to relax and
stop worrying about the bloat.

Leading to waistline bloat?

Jim Gettys

2014-05-21 17:57:57 UTC

Permalink

Post by d***@reed.com

Post by Dave Taht
Or we can take a break, and write books about how we learned to relax and
stop worrying about the bloat.

Leading to waistline bloat?

âWe resemble that remark already....
â

Dave Taht

2014-05-21 18:31:39 UTC

Permalink

Post by Jim Gettys

Post by d***@reed.com

Post by Dave Taht
Or we can take a break, and write books about how we learned to relax and
stop worrying about the bloat.

Leading to waistline bloat?

We resemble that remark already....

I put on 35 pounds since starting to work on this.

--
Dave Täht

NSFW: https://w2.eff.org/Censorship/Internet_censorship_bills/russell_0296_indecent.article

Dave Taht

2014-05-21 15:07:37 UTC

Permalink

I should point out that another issue with deploying fq_codel widely
is that it requires an accurate
measurement (currently) of the providers bandwidth.

My hope/expectation is that more ISPs that
provide CPE will ship something that is configured correctly by
default, following in free.fr's footsteps,
and trying to beat the cable industry to the punch, now that the core
code is debugged and documented, creating an out-of-box win.

Post by Frits Riep
I would agree prioritizing ping is in and of itself not the key goal, but based on what I've read so far, fq-codel provides dramatically better responsiveness for any interactive application such as web-browsing, voip, or gaming, so it qos-scripts would be advantageous for users like your mom if she were in an environment where she had a slow and shared internet connection. Is that a valid interpretation?

Sure. My mom has a fast, non-shared internet connection. Her biggest
problem is she hasn't
got off of windows despite my brother's decade of attempts to move her
to osx.... :)

Markets where this stuff seriously applies as a rate limiter + qos system
today are small to medium business, cybercafes, shared workspaces,
geek-zones, and so on. It also applies on ethernet and in cases where
you want to artificially have a rate limit like:

http://pieknywidok.blogspot.com/2014/05/10g-1g.html

We're ~5 years ahead of the curve here at cerowrt-central. Tools "just
work" for any sysadmin with chops. Commercial products are in the
pipeline.

While it takes time to build it into a product, I'd kind of expect
barracuda and ubnt to add fq_codel
to their products fairly soon, and for at least one switch vendor to follow.

It's in shorewall, ipfire, streamboost, everything downstream from openwrt,
linux mainline (and thus every linux distro) already. I know of a
couple cloud providers that are running
sch_fq and fq_codel already.

One thing I'm a little frustrated about, is that I'd expected sch_fq
to replace pfifo_fast by default
on more linux distros by now. It's a single sysctl...

Post by Frits Riep
I am interested in further understanding the differences based on the brief differences you provide. It is true that few devices provide DSCP marking, but if the latency is controlled for all traffic, latency sensitive traffic benefits tremendously even without prioritizing by l7 (layer 7 ?). Is this interpretation also valid?

Very, very true. Most of the need for prioritization goes away
entirely, due to the "sparse" vs "full" (or fast vs slow) queue
concept in fq_codel. In most circumstances things like voip just cut
through other traffic like butter. Videoconferencing is vastly
improved, also.

However, on very, very slow links (<3mbit), nothing helps enough. It's
not just the qos system that needs to be tuned, but that modern TCPs
and the web are optimized for much faster links and have features that
hurt at low speeds. (what helps most is installing adblock plus!).
Torrent is something of a special case - I find it totally bearable at
20mbit/4mbit without classification - but unbearable at 8/1.

I'm pretty satisfied we have the core algorithms and theory in place,
now, to build edge devices that work much better at 3mbit to 200mbit,
at least, possibly 10gbit or higher.

Post by Frits Riep
Yes, your mom wouldn't be a candidate for setting up ceroWRT herself, but if it were set up for her, or if it could be incorporated into a consumer router with automatically determining speed parameters,

That automatic speedtest thing turns out to be hard.

Post by Frits Riep
she would benefit totally from the performance improvement.

Meh. She needs to get off of windows.

Post by Frits Riep
So the technology ultimately needs to be taken mainstream, and yes that is a huge task.

Yep. If we hadn't given away everything perhaps there would be a
business model to fund that - streamboost is trying that route.

My hope was that the technology is merely so compelling that vendors
would be falling over themselves to answer the customer complaints.
But few have tied "bufferbloat" to the problems gamers and small
business are having with their internet uplinks as yet and more
education and demonstration seems necessary.

There is a huge backlog of potential demand for a better dslam, in
particular, as well as better firewalls and cablemodems. I don't have
a lot of hope for the two CMTS vendors to move to improve things
anytime soon.

Post by Frits Riep
Frits
-----Original Message-----
Sent: Tuesday, May 20, 2014 7:14 PM
To: Frits Riep
Subject: Re: [Cerowrt-devel] Ideas on how to simplify and popularize bufferbloat control for consideration.

Michael Richardson

2014-05-21 16:50:46 UTC

Permalink

Post by Dave Taht
I should point out that another issue with deploying fq_codel widely
is that it requires an accurate
measurement (currently) of the providers bandwidth.

I've been thinking about ways to do this over PPP(oE) links if one controls
both ends --- many third party internet access ISPs terminate the PPP
on their equipment, rather than the telco's, so it should be possible
to avoid all the L2 issues.

My ISP now offers fiber-to-the-neighbourhood, 50Mb/s down, 10 up.
(vs 7/640 that I have now). They are offering me an
http://smartrg.com/products/products/sr505n/

which they suggest I run in bridge (layer-2) mode. I'm trying to figure out
what is inside, as it has the DSL interface right on it. I didn't know
of this device before.

Post by Dave Taht
My hope/expectation is that more ISPs that
provide CPE will ship something that is configured correctly by
default, following in free.fr's footsteps,
and trying to beat the cable industry to the punch, now that the core
code is debugged and documented, creating an out-of-box win.

Agreed.

--
] Never tell me the odds! | ipv6 mesh networks [
] Michael Richardson, Sandelman Software Works | network architect [
] ***@sandelman.ca http://www.sandelman.ca/ | ruby on rails [

David Lang

2014-05-21 17:58:57 UTC

Permalink

I should point out that another issue with deploying fq_codel widely is that
it requires an accurate measurement (currently) of the providers bandwidth.

does it need this accurate measurement for sending or for the recieving pacing?

David Lang

2014-05-24 14:12:56 UTC

Permalink

I should point out that another issue with deploying fq_codel widely is that it requires an accurate measurement (currently) of the providers bandwidth.

Pardon my noobiness, but is there a technical obstacle that prevents
the creation of a user-triggered function on the router side that
measures the provider's bandwidth?

Function, when (luci-gui?) triggered, would:

1. Ensure that internet connectivity is present.
2. Disconnect all clients.
3. Engage in DL and UL on a dedicated web server, measure stats and
straight up use them in fq_codel -- or suggest them in appropriate
QoS-gui user-boxes.

Further, this function could be auto-scheduled or made enabled on
router boot up.

I must be missing something important which prevents this. What is it?

Sebastian Moeller

2014-05-24 17:31:47 UTC

Permalink

Hi R, hi List,

Post by R.

I should point out that another issue with deploying fq_codel widely is that it requires an accurate measurement (currently) of the providers bandwidth.

Pardon my noobiness, but is there a technical obstacle that prevents
the creation of a user-triggered function on the router side that
measures the provider's bandwidth?
1. Ensure that internet connectivity is present.
2. Disconnect all clients.
3. Engage in DL and UL on a dedicated web server, measure stats and
straight up use them in fq_codel -- or suggest them in appropriate
QoS-gui user-boxes.
Further, this function could be auto-scheduled or made enabled on
router boot up.
I must be missing something important which prevents this. What is it?

Well, I see a couple of challenges that need to be overcome before this could work.

In your step 3 you touch the issue of measuring the current stats; and somehow what is trickier than one would think:

1) what to measure precisely, a "dedicated web server" sounds like a great idea, but who is dedicating it and where is it located relative to the link under test?
Rich Brown has made a nice script to measure current throughput and give an estimate on the effect of link saturation on latency (see betterspeedtest.sh from https://github.com/richb-hanover/CeroWrtScripts), but using this from Germany gives:
2014-05-24 15:44:47 Testing against demo.tohojo.dk with 5 simultaneous sessions while pinging gstatic.com (60 seconds in each direction)
Download: 12.06 Mbps
Upload: 1.99 Mbps
against a server in Europe, but:
Download: 10.42 Mbps
Upload: 1.85 Mbps
against a server on the east side of the USA. So the router would need to select a close-by server. Sites as speedtest.net offer this kind of server selection by proximity but do not have a very reliable way to load the link and do not measure the effect of link saturation on the latency… but the whole idea is to find the highest bandwidth that foes not cause indecent increase of latency under load. (Also speed tests are quite stereotypic in observable behavior and length so some ISPs special case these to look good; but that is a different kettle of fish…)
Note that there is also the question where one would like to measure the linkspeed; for example for DSL there is the link to the DSLAM, the link from the DSLAM to the next network node, sometimes a PPP link to a remote BRAS system (that might throttle the traffic). All of these can be the bottlenecks of the ISP connection (depending on circumstances). My take is that one would like to look at the link between modem and DSLAM as the bottleneck, but the opinions differ (and then there is cable with its shared first segment...).

2) Some links have quite peculiar properties that are hard to deduce from quick speed tests. For example ATM based ADSL links (this includes all ADSL1, ADSL2 and to my knowledge all existing ADSL2+ links) will show a packetize dependent link speed. In short ATM uses an integer number of 48 byte cells to transport each packet, so worst case it adds 47 bytes to the payload for small packet that can effectively double the size of the packet on the wire, or stared differently half the link speed for packets of that size. (Note thanks to the work of Jesper Brouer and Russel Stuart the linux kernel can take care of that issue for you, but you need to tell the kernel explicitly.)

3) many links actually do not have a constant wire speed available. For docsis (basically cable) the local segment is shared between many users and transmit timeslots are shared between requestors, giving effectively slower links during peak hours. For DSL a resync between DSLAM and modem can (significantly) change the negotiated speed; something cerowrt does not get any notice of…

I guess buffer bloat mitigation needs to move into the modems and DSLAMs to get rid of the bandwidth guessing game. For cable at least the modems are getting better (thanks to PIE being part of the docsis 3.1? standard), but for DSL I do not think there is any generic solution on the horizon…

Best Regards
Sebastian

Post by R.
_______________________________________________
Cerowrt-devel mailing list
https://lists.bufferbloat.net/listinfo/cerowrt-devel

David P. Reed

2014-05-24 19:05:46 UTC

Permalink

Depends on the type of the provider. Most providers now have shared paths to the backbone among users and give a peak rate up and down for brief periods that they will not sustain... In fact they usually penalize use of the peak rate by reducing the rate after that.

So at what point they create bloat in their access net is hard to determine. And it depends on your neighbors' behavior as well.

The number you want is the bloatedness of your path through the access provider.

This is measurable by sending small probes back and forth to a measurement server... Measuring instantaneous latency in each direction and combining that information with one's recent history in a non trivial calculation.

Note that that measurement does not directly produce provider speeds that can be input to the shapers used in codel. But it does produce a queue size that can.

So it's a plausible way to proceed as long as the operators refuse to fix their gear to manage the actual link that is problematic.

Personally I'd suggest that the gear makers' feet be held to the fire... by not "fixing" it by an inferior fix at the home router. Keep the pressure on them at IETF and among their customers.

Post by Dave Taht
I should point out that another issue with deploying fq_codel widely

is that it requires an accurate measurement (currently) of the
providers bandwidth.
Pardon my noobiness, but is there a technical obstacle that prevents
the creation of a user-triggered function on the router side that
measures the provider's bandwidth?
1. Ensure that internet connectivity is present.
2. Disconnect all clients.
3. Engage in DL and UL on a dedicated web server, measure stats and
straight up use them in fq_codel -- or suggest them in appropriate
QoS-gui user-boxes.
Further, this function could be auto-scheduled or made enabled on
router boot up.
I must be missing something important which prevents this. What is it?
_______________________________________________
Cerowrt-devel mailing list
https://lists.bufferbloat.net/listinfo/cerowrt-devel

-- Sent from my Android device with K-@ Mail. Please excuse my brevity.

2014-05-24 14:03:18 UTC

Permalink

I should point out that another issue with deploying fq_codel widely is that it requires an accurate measurement (currently) of the providers bandwidth.

V***@vt.edu

2014-07-25 18:37:34 UTC

Permalink

Post by R.
Further, this function could be auto-scheduled or made enabled on
router boot up.

Yeah, if such a thing worked, it would be good.

(Note in the following that a big part of my *JOB* is doing "What could
possibly go wrong?" analysis on mission-critical systems, which tends to color
my viewpoint on projects. I still think the basic concept is good, just
difficult to do, and am listing the obvious challenges for anybody brave
enough to tackle it... :)

Post by R.
I must be missing something important which prevents this. What is it?

There's a few biggies. The first is what the linux-kernel calls -ENOPATCH -
nobody's written the code. The second is you need an upstream target someplace
to test against. You need to deal with both the "server is unavalailable due
to a backhoe incident 2 time zones away" problem (which isn't *that* hard, just
default to Something Not Obviously Bad(TM), and "server is slashdotted" (whci
is a bit harder to deal with. Remember that there's some really odd corner
cases to worry about - for instance, if there's a power failure in a town, then
when the electric company restores power you're going to have every cerowrt box
hit the server within a few seconds - all over the same uplink most likely. No
good data can result from that... (Holy crap, it's been almost 3 decades since
I first saw a Sun 3/280 server tank because 12 Sun 3/50s all rebooted over the
network at once when building power was restored).

And if you're in Izbekistan and the closest server netwise is at 60 Hudson, the
analysis to compute the correct values becomes.... interesting.

Dealing with non-obvious error conditions is also a challenge - a router
may only boot once every few months. And if you happen to be booting just
as a BGP routing flap is causing your traffic to take a vastly suboptimal
path, you may end up encoding a vastly inaccurate setting and have it stuck
there, causing suckage for non-obvious reasons for the non-technical, so you
really don't want to enable auto-tuning unless you also have a good plan for
auto-*RE*tuning....

David Lang

2014-07-25 21:03:38 UTC

Permalink

Post by V***@vt.edu

Post by R.
Further, this function could be auto-scheduled or made enabled on
router boot up.

Yeah, if such a thing worked, it would be good.
(Note in the following that a big part of my *JOB* is doing "What could
possibly go wrong?" analysis on mission-critical systems, which tends to color
my viewpoint on projects. I still think the basic concept is good, just
difficult to do, and am listing the obvious challenges for anybody brave
enough to tackle it... :)

Post by R.
I must be missing something important which prevents this. What is it?

There's a few biggies. The first is what the linux-kernel calls -ENOPATCH -
nobody's written the code. The second is you need an upstream target someplace
to test against. You need to deal with both the "server is
unavalailable due
to a backhoe incident 2 time zones away" problem (which isn't *that* hard, just
default to Something Not Obviously Bad(TM), and "server is
slashdotted" (whci
is a bit harder to deal with. Remember that there's some really odd corner
cases to worry about - for instance, if there's a power failure in a town, then
when the electric company restores power you're going to have every cerowrt box
hit the server within a few seconds - all over the same uplink most likely. No
good data can result from that... (Holy crap, it's been almost 3 decades since
I first saw a Sun 3/280 server tank because 12 Sun 3/50s all rebooted over the
network at once when building power was restored).
And if you're in Izbekistan and the closest server netwise is at 60 Hudson, the
analysis to compute the correct values becomes.... interesting.
Dealing with non-obvious error conditions is also a challenge - a router
may only boot once every few months. And if you happen to be booting just
as a BGP routing flap is causing your traffic to take a vastly
suboptimal
path, you may end up encoding a vastly inaccurate setting and have it stuck
there, causing suckage for non-obvious reasons for the non-technical, so you
really don't want to enable auto-tuning unless you also have a good plan for
auto-*RE*tuning....

have the router record it's finding, and then repeat the test
periodically, recording it's finding as well. If the new finding is
substantially different from the prior ones, schedule a retest 'soon'
(or default to the prior setting if it's bad enough), otherwise, if
there aren't many samples, schedule a test 'soon' if there are a lot of
samples, schedule a test in a while.

However, I think the big question is how much the tuning is required.

If a connection with BQL and fq_codel is 90% as good as a tuned setup,
default to untuned unless the user explicitly hits a button to measure
(and then a second button to accept the measurement)

If BQL and fw_codel by default are M70% as good as a tuned setup,
there's more space to argue that all setups must be tuned, but then the
question is how to they fare against a old, non-BQL, non-fq-codel setup?
if they are considerably better, it may still be worthwhile.

David Lang

Sebastian Moeller

2014-07-26 11:30:08 UTC

Permalink

Hi David,

Post by V***@vt.edu

Post by R.
Further, this function could be auto-scheduled or made enabled on
router boot up.

Yeah, if such a thing worked, it would be good.
(Note in the following that a big part of my *JOB* is doing "What could
possibly go wrong?" analysis on mission-critical systems, which tends to color
my viewpoint on projects. I still think the basic concept is good, just
difficult to do, and am listing the obvious challenges for anybody brave
enough to tackle it... :)

Post by R.
I must be missing something important which prevents this. What is it?

There's a few biggies. The first is what the linux-kernel calls -ENOPATCH -
nobody's written the code. The second is you need an upstream target someplace
to test against. You need to deal with both the "server is unavalailable due
to a backhoe incident 2 time zones away" problem (which isn't *that* hard, just
default to Something Not Obviously Bad(TM), and "server is slashdotted" (whci
is a bit harder to deal with. Remember that there's some really odd corner
cases to worry about - for instance, if there's a power failure in a town, then
when the electric company restores power you're going to have every cerowrt box
hit the server within a few seconds - all over the same uplink most likely. No
good data can result from that... (Holy crap, it's been almost 3 decades since
I first saw a Sun 3/280 server tank because 12 Sun 3/50s all rebooted over the
network at once when building power was restored).
And if you're in Izbekistan and the closest server netwise is at 60 Hudson, the
analysis to compute the correct values becomes.... interesting.
Dealing with non-obvious error conditions is also a challenge - a router
may only boot once every few months. And if you happen to be booting just
as a BGP routing flap is causing your traffic to take a vastly suboptimal
path, you may end up encoding a vastly inaccurate setting and have it stuck
there, causing suckage for non-obvious reasons for the non-technical, so you
really don't want to enable auto-tuning unless you also have a good plan for
auto-*RE*tuning....

Yeah, keeping some history to “predict” when to measure next sounds clever.

However, I think the big question is how much the tuning is required.

I assume in most cases you need to measure the home-routers bandwidth rarely (say on DSL only after a re-sync with the DSLAM), but you need to measure the bandwidth early as only then you can properly shape the downlink. And we need to know the link’s capacity to use traffic shaping so that BQL and fq_codel in the router have control over the bottleneck queue… An equivalent of BQL and fq_codel running in the DSLAM/CMTS and CPE obviously would be what we need, because then BQL and fq_codel on the router would be all that is required. But that does not seem like it is happening anytime soon, so we still need to workaround the limitations in the equipment fr a long time to come, I fear.

If a connection with BQL and fq_codel is 90% as good as a tuned setup, default to untuned unless the user explicitly hits a button to measure (and then a second button to accept the measurement)
If BQL and fw_codel by default are M70% as good as a tuned setup, there's more space to argue that all setups must be tuned, but then the question is how to they fare against a old, non-BQL, non-fq-codel setup? if they are considerably better, it may still be worthwhile.

Best Regards
Sebastian

David Lang
_______________________________________________
Cerowrt-devel mailing list
https://lists.bufferbloat.net/listinfo/cerowrt-devel

David Lang

2014-07-26 20:39:59 UTC

Permalink

Post by Sebastian Moeller
Hi David,

Post by V***@vt.edu

Post by R.
Further, this function could be auto-scheduled or made enabled on
router boot up.

Yeah, if such a thing worked, it would be good.
(Note in the following that a big part of my *JOB* is doing "What could
possibly go wrong?" analysis on mission-critical systems, which tends to color
my viewpoint on projects. I still think the basic concept is good, just
difficult to do, and am listing the obvious challenges for anybody brave
enough to tackle it... :)

Post by R.
I must be missing something important which prevents this. What is it?

There's a few biggies. The first is what the linux-kernel calls -ENOPATCH -
nobody's written the code. The second is you need an upstream target someplace
to test against. You need to deal with both the "server is unavalailable due
to a backhoe incident 2 time zones away" problem (which isn't *that* hard, just
default to Something Not Obviously Bad(TM), and "server is slashdotted" (whci
is a bit harder to deal with. Remember that there's some really odd corner
cases to worry about - for instance, if there's a power failure in a town, then
when the electric company restores power you're going to have every cerowrt box
hit the server within a few seconds - all over the same uplink most likely. No
good data can result from that... (Holy crap, it's been almost 3 decades since
I first saw a Sun 3/280 server tank because 12 Sun 3/50s all rebooted over the
network at once when building power was restored).
And if you're in Izbekistan and the closest server netwise is at 60 Hudson, the
analysis to compute the correct values becomes.... interesting.
Dealing with non-obvious error conditions is also a challenge - a router
may only boot once every few months. And if you happen to be booting just
as a BGP routing flap is causing your traffic to take a vastly suboptimal
path, you may end up encoding a vastly inaccurate setting and have it stuck
there, causing suckage for non-obvious reasons for the non-technical, so you
really don't want to enable auto-tuning unless you also have a good plan for
auto-*RE*tuning....

Yeah, keeping some history to predict when to measure next sounds clever.

However, I think the big question is how much the tuning is required.

I assume in most cases you need to measure the home-routers bandwidth rarely
(say on DSL only after a re-sync with the DSLAM), but you need to measure the
bandwidth early as only then you can properly shape the downlink. And we need
to know the links capacity to use traffic shaping so that BQL and fq_codel in
the router have control over the bottleneck queue An equivalent of BQL and
fq_codel running in the DSLAM/CMTS and CPE obviously would be what we need,
because then BQL and fq_codel on the router would be all that is required. But
that does not seem like it is happening anytime soon, so we still need to
workaround the limitations in the equipment fr a long time to come, I fear.

by how much tuning is required, I wasn't meaning how frequently to tune, but how
close default settings can come to the performance of a expertly tuned setup.

Ideally the tuning takes into account the characteristics of the hardware of the
link layer. If it's IP encapsulated in something else (ATM, PPPoE, VPN, VLAN
tagging, ethernet with jumbo packet support for example), then you have overhead
from the encapsulation that you would ideally take into account when tuning
things.

the question I'm talking about below is how much do you loose compared to the
idea if you ignore this sort of thing and just assume that the wire is dumb and
puts the bits on them as you send them? By dumb I mean don't even allow for
inter-packet gaps, don't measure the bandwidth, don't try to pace inbound
connections by the timing of your acks, etc. Just run BQL and fq_codel and start
the BQL sizes based on the wire speed of your link (Gig-E on the 3800) and
shrink them based on long-term passive observation of the sender.

If you end up only loosing 5-10% of your overall network performance by ignoring
the details of the wire, then we should ignore them by default.

If however, not measuring anything first results in significantly worse
performance than a tuned setup, then we need to figure out how to do the
measurements needed for tuning.

Some people seem to have fallen into the "perfect is the enemy of good enough"
trap on this topic. They are so fixated on getting the absolute best performance
out of a link that they are forgetting how bad the status-quo is right now.

If you look at the graph that Dave Taht put on page 6 of his slide deck
http://snapon.lab.bufferbloat.net/~d/Presos/CaseForComprehensiveQueueManagement/assets/player/KeynoteDHTMLPlayer.html#5
it's important to realize that even the worst of the BQL+fq_codel graphs is
worlds better than the default setting, while it would be nice to get to the
green trace on the left, even getting to the middle traces instead of the black
trace on the right would be a huge win for the public.

David Lang