[mosh-devel] Mosh on Kuberbnetes

Discussion:

Thomas Buckley-Houston

2018-06-24 04:20:32 UTC

Hello Mosh Developers,

This is my first post to the list. Thanks for the introduction Keith.

I'm the author of Texttop (soon to be re-released as Browsh):
https://github.com/tombh/texttop It is a fully-modern text-based
browser.

You can actually SSH into a demo: `ssh brow.sh` (no auth needed). This
is not a publicised service yet, there are only 3 instances, so don't
share it too much.

Anyway, my problem is that the SSH service is running on Kubernetes
behind a load balancer and of course the load balancer likes to spread
out connections to the instances. Thus, by default, it cannot
guarantee that, when using Mosh, the upgrade from SSH port 22 to Mosh
port 60001 arrives at the same instance.

I won't go into why, but it'll be a significant hurdle to enable
IP-based sticky sessions, so that users are guaranteed the same
instance. So first I just want to cross other approaches off my list.
My first thought is whether anyone has ever wrapped mosh-server? Eg;
How straight forward would it be to have a proxy listening on Mosh
ports to intercept the handshake from mosh-client? That way I could
have my own centralised token management, that once verified the
token, could fire up mosh-server and feed it the incoming connection.

At the very least, I could actually get away with just running
mosh-server without key checking, as this is currently all only for
demo purposes and no actual authentication is needed. I just need
mosh-server to refuse more than 1 connection on the same port, which I
assume it already does? Would it be as simple as patching a single
line to disable key checking?

Many thanks for any ideas or feedback,
Tom

Anders Kaseorg

2018-06-24 05:36:03 UTC

Permalink

You may have a misunderstanding about how a Mosh session is set up. The
mosh script launches a mosh-server on the remote system via SSH;
mosh-server picks a port number and a random encryption key, and writes
them to stdout, where they go back over SSH to the mosh script; then the
mosh script launches mosh-client passing the IP address, port number, and
encryption key. The newly launched mosh-client and mosh-server processes
exchange UDP packets encrypted with the shared key; communication is
successful if the packets can be decrypted.

There’s no separate “key checking” step to be disabled. And it doesn’t
make sense to “refuse more than 1 connection on the same port”, both
because UDP is connectionless, and because a new mosh-server is launched
on a new port for each Mosh session (it is not a daemon like sshd).

The easiest way to put Mosh servers behind a load balancer is with
round-robin DNS where a single hostname resolves to many addresses, or to
different addresses for different clients and/or at different times.
We’ve already gone out of our way to make the mosh script resolve the
hostname only once and use the same address for the SSH connection and the
UDP packets, because that’s needed for MIT’s athena.dialup.mit.edu pool.

If that’s not an option and you really need all connections to go through
a single load balancer address, you could try wrapping mosh-server in a
script that passes different disjoint port ranges (-p) on different
backends, and forwarding those ranges to the corresponding backends from
the load balancer.

Unrelatedly, brow.sh doesn’t resolve with DNSSEC-enabled resolvers like
1.1.1.1 or 8.8.8.8, seemingly due to some problem with the DS records set
with the registrar: https://dnssec-debugger.verisignlabs.com/brow.sh.

Anders

Thomas Buckley-Houston

2018-06-25 10:10:58 UTC

Permalink

Thanks so much for the clarification.

Post by Anders Kaseorg
UDP is connectionless

That's the key here. So I have no choice but to use sticky IP-based
routing. Round-robin DNS isn't an option I don't think, because I hope
one day to be able to scale to thousands of servers.

And thanks so much for the heads up about my DNSSEC records. I've sent
a request for them to be deleted. I'd added them and some SSHFP
records to explore automatically passing the StrictHostKey warning.
But it's not entirely straight forward. Even with correct DNS records
the SSH user still has to have VerifyHostKeyDNS enabled, which as I
understand most people don't. And then on top of that my DNS provider
(DNSSimple) automatically rotate the keys every 3 months, which means
I have to manually send a request to my registrars by email to update
the DNSSEC records. Is it all worth it do you think?

Post by Anders Kaseorg
You may have a misunderstanding about how a Mosh session is set up. The
mosh script launches a mosh-server on the remote system via SSH;
mosh-server picks a port number and a random encryption key, and writes
them to stdout, where they go back over SSH to the mosh script; then the
mosh script launches mosh-client passing the IP address, port number, and
encryption key. The newly launched mosh-client and mosh-server processes
exchange UDP packets encrypted with the shared key; communication is
successful if the packets can be decrypted.
There’s no separate “key checking” step to be disabled. And it doesn’t
make sense to “refuse more than 1 connection on the same port”, both
because UDP is connectionless, and because a new mosh-server is launched
on a new port for each Mosh session (it is not a daemon like sshd).
The easiest way to put Mosh servers behind a load balancer is with
round-robin DNS where a single hostname resolves to many addresses, or to
different addresses for different clients and/or at different times.
We’ve already gone out of our way to make the mosh script resolve the
hostname only once and use the same address for the SSH connection and the
UDP packets, because that’s needed for MIT’s athena.dialup.mit.edu pool.
If that’s not an option and you really need all connections to go through
a single load balancer address, you could try wrapping mosh-server in a
script that passes different disjoint port ranges (-p) on different
backends, and forwarding those ranges to the corresponding backends from
the load balancer.
Unrelatedly, brow.sh doesn’t resolve with DNSSEC-enabled resolvers like
1.1.1.1 or 8.8.8.8, seemingly due to some problem with the DS records set
with the registrar: https://dnssec-debugger.verisignlabs.com/brow.sh.
Anders

john hood

2018-06-25 13:12:21 UTC

Permalink

Post by Thomas Buckley-Houston
Thanks so much for the clarification.

Post by Anders Kaseorg
UDP is connectionless

That's the key here. So I have no choice but to use sticky IP-based
routing. Round-robin DNS isn't an option I don't think, because I hope
one day to be able to scale to thousands of servers.

A simple DNS round-robin may not scale, but CDNs use DNS to load
balance and geolocate traffic all the time. DNS load balancing is not
as immediate as a middlebox, but you are going to be wrangling a
slow-moving load of long-lived user sessions, not HTTP connections.
If you do end up scaling to thousands of servers, you'll need to do
some DNS-based dynamic management anyway.

Since Mosh fundamentally needs to know the same destination for its
SSH and Mosh session, it doesn't mesh well with load balancers or
other technologies that obscure destination addresses.

Also, brow.sh doesn't work for me, without any DNSSEC involved-- it
looks like ns?.dnsimple.com are not returning authoritative answers
for brow.sh (though the IP addresses they give out does work).

regards,

--jh

Keith Winstein

2018-06-26 20:50:48 UTC

Permalink

Hi Thomas,

Glad you could provoke a very interesting discussion! But I'm still
confused -- how is "sticky IP-based routing" going to work after the client
roams to a new IP address (or to a new UDP source port)? When your system
seems an incoming UDP datagram from a previously unseen source IP:port, how
does it know which mosh-server (on which server machine) to send it to?

With off-the-shelf Mosh, you basically need a load-balancing strategy that
allows a destination IP:port to uniquely identify a particular mosh-server.
You can do this with multiple DNS A/AAAA records (where the client picks
the winning one -- maybe you permute the list), or with a smart DNS server
that serves *one* A or AAAA record to the client at the time of resolution
(like a CDN would use).

Instead of using the mosh wrapper script, you could have your users use
some other scheme to figure out the IP:port of the server, but the point is
that once you launch the mosh-client, it's going to keep sending datagrams
to the IP:port of the mosh-server, and those datagrams need to get to the
same mosh-server process even if the client roams to a different
publicly-visible IP address or port.

You could imagine writing a very smart mosh proxy that has the keys to all
the sessions and can figure out (for an incoming datagram coming from an
unknown source IP:port) which session it actually belongs to, and then
makes a sticky mapping and routes it to the proper mosh-server. But I don't
think anybody has actually done this yet and of course there's a challenge
in making this reliable/replicated.

-Keith

Post by Thomas Buckley-Houston
Thanks so much for the clarification.

Post by Anders Kaseorg
UDP is connectionless

That's the key here. So I have no choice but to use sticky IP-based
routing. Round-robin DNS isn't an option I don't think, because I hope
one day to be able to scale to thousands of servers.
And thanks so much for the heads up about my DNSSEC records. I've sent
a request for them to be deleted. I'd added them and some SSHFP
records to explore automatically passing the StrictHostKey warning.
But it's not entirely straight forward. Even with correct DNS records
the SSH user still has to have VerifyHostKeyDNS enabled, which as I
understand most people don't. And then on top of that my DNS provider
(DNSSimple) automatically rotate the keys every 3 months, which means
I have to manually send a request to my registrars by email to update
the DNSSEC records. Is it all worth it do you think?

the

Post by Anders Kaseorg
UDP packets, because thatâs needed for MITâs athena.dialup.mit.edu pool.
If thatâs not an option and you really need all connections to go through
a single load balancer address, you could try wrapping mosh-server in a
script that passes different disjoint port ranges (-p) on different
backends, and forwarding those ranges to the corresponding backends from
the load balancer.
Unrelatedly, brow.sh doesnât resolve with DNSSEC-enabled resolvers like
1.1.1.1 or 8.8.8.8, seemingly due to some problem with the DS records set
with the registrar: https://dnssec-debugger.verisignlabs.com/brow.sh.
Anders

_______________________________________________
mosh-devel mailing list
http://mailman.mit.edu/mailman/listinfo/mosh-devel

Thomas Buckley-Houston

2018-06-29 07:17:09 UTC

Permalink

Hey Keith, John, everyone,

Yeah the more this is looking like a quite a big hurdle. Especially
your point Keith about roaming IPs (which I'd forgotten), it's a
central feature of Mosh I don't want to lose.

So the only 2 options seems to be exposing multiple IPs for Round
Robin (or other smart DNS routing) or writing a new Mosh proxy that
already has knowledge of the existing keys. Both seem like quite a
challenge. Round Robin DNS seems more approachable and I can imagine
integrating it with the Google Cloud DNS API I'm already using, but I
just wonder how expensive Google (or anyone for that matter) will make
thousands of static IP addresses? Apart from me having to learn Mosh
internals, one difficulty that strikes me about a Mosh proxy is that
it might introduce a non-trivial delay to each datagram arriving?
Though surely only ever in the order of a handful of milliseconds I
suppose.

Are there not any other identifying marks to a datagram, I don't know
much about low level networking, but maybe something like a MAC
address for example?

Thanks,
Tom

Post by Keith Winstein
Hi Thomas,
Glad you could provoke a very interesting discussion! But I'm still confused
-- how is "sticky IP-based routing" going to work after the client roams to
a new IP address (or to a new UDP source port)? When your system seems an
incoming UDP datagram from a previously unseen source IP:port, how does it
know which mosh-server (on which server machine) to send it to?
With off-the-shelf Mosh, you basically need a load-balancing strategy that
allows a destination IP:port to uniquely identify a particular mosh-server.
You can do this with multiple DNS A/AAAA records (where the client picks the
winning one -- maybe you permute the list), or with a smart DNS server that
serves *one* A or AAAA record to the client at the time of resolution (like
a CDN would use).
Instead of using the mosh wrapper script, you could have your users use some
other scheme to figure out the IP:port of the server, but the point is that
once you launch the mosh-client, it's going to keep sending datagrams to the
IP:port of the mosh-server, and those datagrams need to get to the same
mosh-server process even if the client roams to a different publicly-visible
IP address or port.
You could imagine writing a very smart mosh proxy that has the keys to all
the sessions and can figure out (for an incoming datagram coming from an
unknown source IP:port) which session it actually belongs to, and then makes
a sticky mapping and routes it to the proper mosh-server. But I don't think
anybody has actually done this yet and of course there's a challenge in
making this reliable/replicated.
-Keith

Post by Thomas Buckley-Houston
Thanks so much for the clarification.

Post by Anders Kaseorg
UDP is connectionless

That's the key here. So I have no choice but to use sticky IP-based
routing. Round-robin DNS isn't an option I don't think, because I hope
one day to be able to scale to thousands of servers.
And thanks so much for the heads up about my DNSSEC records. I've sent
a request for them to be deleted. I'd added them and some SSHFP
records to explore automatically passing the StrictHostKey warning.
But it's not entirely straight forward. Even with correct DNS records
the SSH user still has to have VerifyHostKeyDNS enabled, which as I
understand most people don't. And then on top of that my DNS provider
(DNSSimple) automatically rotate the keys every 3 months, which means
I have to manually send a request to my registrars by email to update
the DNSSEC records. Is it all worth it do you think?

_______________________________________________
mosh-devel mailing list
http://mailman.mit.edu/mailman/listinfo/mosh-devel

Keith Winstein

2018-07-01 04:54:34 UTC

Permalink

How about a semi-smart (but mostly Mosh-oblivious) server-side proxy/NAT
that works like this:

- The proxy service has one public IP address and like 65,000 available UDP
ports.
- The proxy service can itself be redundant with failover...
- When a user wants to open a new Mosh connection, they Mosh to a single
hostname (which resolves to the IP address of the proxy service).
- Your code allocates the necessary container, etc., and also allocates a
unique UDP port on the proxy.
- Your code runs the new mosh-server process in the target container.
- The proxy intercepts the mosh-server's "MOSH CONNECT <port> <key>"
message, replacing the port number with the unique public-facing UDP port
(and remembering the container's IP address and the original port number).
- When the proxy receives an incoming UDP datagram destined to a particular
UDP port, it forwards it to the appropriate container at its IP address and
at the original port number. It *preserves* the source IP and port of the
datagram when forwarding.
- When the container wants to send an outgoing UDP datagram, it sends it
normally (to whatever IP:port is associated with the client), except the
containers are not directly connected to the Internet; they use the
proxy/NAT as their next-hop router.
- For the outgoing UDP datagram, the proxy/NAT rewrites the container's
source IP:port to its own IP and the public port number.

I think this will allow you to serve like 65,000 separate mosh connections
from a single public IP address...

The added latency in forwarding a datagram is probably <1 ms, and you don't
really have to change anything about Mosh itself or its internals.

Unfortunately there are no unencrypted identifying marks to a Mosh
connection, except the incrementing sequence numbers (which start at 0 for
every connection).

-Keith

Post by Thomas Buckley-Houston
Hey Keith, John, everyone,
Yeah the more this is looking like a quite a big hurdle. Especially
your point Keith about roaming IPs (which I'd forgotten), it's a
central feature of Mosh I don't want to lose.
So the only 2 options seems to be exposing multiple IPs for Round
Robin (or other smart DNS routing) or writing a new Mosh proxy that
already has knowledge of the existing keys. Both seem like quite a
challenge. Round Robin DNS seems more approachable and I can imagine
integrating it with the Google Cloud DNS API I'm already using, but I
just wonder how expensive Google (or anyone for that matter) will make
thousands of static IP addresses? Apart from me having to learn Mosh
internals, one difficulty that strikes me about a Mosh proxy is that
it might introduce a non-trivial delay to each datagram arriving?
Though surely only ever in the order of a handful of milliseconds I
suppose.
Are there not any other identifying marks to a datagram, I don't know
much about low level networking, but maybe something like a MAC
address for example?
Thanks,
Tom

Post by Keith Winstein
Hi Thomas,
Glad you could provoke a very interesting discussion! But I'm still

confused

Post by Keith Winstein
-- how is "sticky IP-based routing" going to work after the client roams

Post by Keith Winstein
a new IP address (or to a new UDP source port)? When your system seems an
incoming UDP datagram from a previously unseen source IP:port, how does

Post by Keith Winstein
know which mosh-server (on which server machine) to send it to?
With off-the-shelf Mosh, you basically need a load-balancing strategy

that

Post by Keith Winstein
allows a destination IP:port to uniquely identify a particular

mosh-server.

Post by Keith Winstein
You can do this with multiple DNS A/AAAA records (where the client picks

the

Post by Keith Winstein
winning one -- maybe you permute the list), or with a smart DNS server

that

Post by Keith Winstein
serves *one* A or AAAA record to the client at the time of resolution

(like

Post by Keith Winstein
a CDN would use).
Instead of using the mosh wrapper script, you could have your users use

some

Post by Keith Winstein
other scheme to figure out the IP:port of the server, but the point is

that

Post by Keith Winstein
once you launch the mosh-client, it's going to keep sending datagrams to

the

Post by Keith Winstein
IP:port of the mosh-server, and those datagrams need to get to the same
mosh-server process even if the client roams to a different

publicly-visible

Post by Keith Winstein
IP address or port.
You could imagine writing a very smart mosh proxy that has the keys to

all

Post by Keith Winstein
the sessions and can figure out (for an incoming datagram coming from an
unknown source IP:port) which session it actually belongs to, and then

makes

Post by Keith Winstein
a sticky mapping and routes it to the proper mosh-server. But I don't

think

Post by Keith Winstein
anybody has actually done this yet and of course there's a challenge in
making this reliable/replicated.
-Keith

Post by Thomas Buckley-Houston
Thanks so much for the clarification.

Post by Anders Kaseorg
UDP is connectionless

That's the key here. So I have no choice but to use sticky IP-based
routing. Round-robin DNS isn't an option I don't think, because I hope
one day to be able to scale to thousands of servers.
And thanks so much for the heads up about my DNSSEC records. I've sent
a request for them to be deleted. I'd added them and some SSHFP
records to explore automatically passing the StrictHostKey warning.
But it's not entirely straight forward. Even with correct DNS records
the SSH user still has to have VerifyHostKeyDNS enabled, which as I
understand most people don't. And then on top of that my DNS provider
(DNSSimple) automatically rotate the keys every 3 months, which means
I have to manually send a request to my registrars by email to update
the DNSSEC records. Is it all worth it do you think?

Post by Anders Kaseorg
You may have a misunderstanding about how a Mosh session is set up.

The