[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [hobbit] Future of Hobbit

To: hobbit (at) hswn.dk
Subject: Re: [hobbit] Future of Hobbit
From: Charles Jones <jonescr (at) cisco.com>
Date: Fri, 25 Jan 2008 12:43:41 -0700
Authentication-results: sj-dkim-3; header.From=jonescr (at) cisco.com; dkim=pass ( sig from cisco.com/sjdkim3002 verified; );
Dkim-signature: v=1; a=rsa-sha256; q=dns/txt; l=3236; t=1201290414; x=1202154414; c=relaxed/simple; s=sjdkim3002; h=Content-Type:From:Subject:Content-Transfer-Encoding:MIME-Version; d=cisco.com; i=jonescr (at) cisco.com; z=From:=20Charles=20Jones=20<jonescr (at) cisco.com> |Subject:=20Re=3A=20[hobbit]=20Future=20of=20Hobbit |Sender:=20; bh=QkHZGwvACinrzSDCNLngxIroO7jwi/HvByb6IGMb15g=; b=Hw5icSDntu4ZhBvd7ddGxLQpClRBCCEz+AyaT3c/XSpUOy7byyQANktJDi QehFZaZdW/dQ0WlSeKqPUU987vK5l2VlfEA+L1Llgu7eolKbUHLQ/BLrrAnn rwfyFtAhj/;
References: <C3BF7B35.16E1%tim.rotunda (at) twcable.com>
User-agent: Thunderbird 2.0.0.9 (X11/20071115)

I think Henriks stance on having the server collect data via sshconnections just doesn't scale. Sure it works fine for a few dozenhosts, but let's say you have 2000 servers...now you are expecting beable to make 2000 trouble-free ssh connections before the next pollingcycle begins. This introduces many problems:

* How many ssh sessions can you run at the same time without spiking theload on the hobbit server?* What happens when an ssh session hangs (could hang the hobbit server,or make the poll cycle take too long)

You do know about the "pulldata" option? It allows the Hobbit server todo a "pull" instead of waiting for client "push". This works fairlywell, and I am using it in a production environment. I can see how itwould not scale to well either though, for a really large number of hosts.

To picture the scalability, imagine a server that only has to receiveupdates from hobbit clients. All it has to do is listen on port 1984,and using relatively little CPU it can probably handle a constant flowof client updates.

Now imagine a server that has to go and fetch the client data itself.There is a LOT more overhead and processing involved in launching anoutgoing ssh connection, running a remote client data-gathering command,waiting for the output, etc. Imagine 2000 of those firing off every 5minutes. How many simultaneous ssh sessions can your server handle?I've seen a server brought to its knees by a script that ran amok andwas doing 50 simulataneous scp commands :) Some time saving is done byusing msgcache (no waiting for the data-gathering), but there is stillthe overhead of ssh itself, and having key-based ssh ability could bedeemed a security risk (anyone who hacks into the hobbit server couldthen ssh to all of your client machines without a password).

A good solution would be an ssl-encrypted, bi-directional protocol. Thiswould allow secure transfer of client data, either push or pull, withoutthe overhead, management, and security risks of using ssh.

In the meantime, definitely check out the pulldata+msgcache option, asit sounds like it will do what you want.


-Charles

Tim Rotunda wrote:

To answer Axel's what is it question.....its a Hobbit version of BB-Central,
which runs on a central server like hobbit does.  It reaches out to the
clients via ssh (or whatever) and collects data.  I did a shell script
version a few years ago and it worked good until the client count topped
25-30.  Then I migrated it to C and it would handle 60+ nodes pretty well.
Then I migrated that to a multi-threaded C process and it really smoked.  I
never did reach the limit with that version.  I think they are still using
it and adding nodes to the client list, which is prob over 250 or so.

I was going to put it out to the community but my company would not allow it
(idiots) so I couldn't.  I now work only 40 hours a week so now I have some
time to myself and was thinking about rewriting it from memory and putting
it out there.  I would put out the one that is threaded and it would prob
just be for x86 Linux, which should build on Solaris, HP-UX, etc.

Follow-Ups:
- Re: [hobbit] Future of Hobbit
  - From: Tim Rotunda
- Re: [hobbit] Future of Hobbit
  - From: Hobbit User in Richmond
- Re: [hobbit] Future of Hobbit
  - From: Josh Luthman

References:
- Re: [hobbit] Future of Hobbit
  - From: Tim Rotunda

Prev by Date: disabling monitoring selectively
Next by Date: Re: [hobbit] Future of Hobbit
Previous by thread: Re: [hobbit] Future of Hobbit
Next by thread: Re: [hobbit] Future of Hobbit
Index(es):
- Date
- Thread