Bandwidth accounting with ulogd

My post about repositories wasn't just a little attempt to stave off work, it was part of a larger scheme.

I share the ADSL line in my digs with 3 other people. We do split-routing to save money, but we still have to divide the phone bill at the end of the month. Rather than buy a fixed cap, and have a fight over who's fault it was when we get capped, we are running a pay-per-use system (with local use free, subsidised by me). It means you don't have to restrain yourself for the common cap, but it also means I need to calculate who owes what.

For the first month, I used my old standby, bandwidthd. It uses pcap to count traffic, and gives you totals and graphs. For simplicity of logging, I gave each person a /28 for their machines and configured static DHCP leases. Then bandwidthd totalled up the internet use for each /28.

This was sub-optimal. bandwidthd either sees the local network, in which case it can't see which packets went out over which link. Or it can watch the international link, but then not know which user is responsible.

I could have installed some netflow utilities at this point, but I wanted to roll my own with the correct Linux approach (ulog) rather than any pcapping. ulogd is the easy ulog solution.

Ulogd can pick up packets that you "-j ULOG" from iptables. It receives them over a netlink interface. You can tell iptables how many bytes of each packet to send, and how many to queue up before sending them. E.g.

# iptables -I INPUT 1 -j ULOG --ulog-nlgroup 1 --ulog-qthreshold 50 --ulog-cprange 48 --ulog-prefix input

will log the first 48 bytes of any incoming packet to netlink-group 1. It will tag the packet as being "input", and send them in batches of 50. 48 bytes is usually enough to catch any data you could want from the headers. If you were only need size, 4 bytes will do, and for source and destination as well, 20.

Now, we tell ulogd to listen for this stuff and log it. Ulogd has a pluggable architecture. IPv4 decoding is a plugin, and there are various logging plugins for "-j LOG" emulation, Text files, pcap-files, MySQL, PostgreSQL, and SQLite. For my purposes, I used MySQL as the router in question already had MySQL on it (for Cacti). Otherwise, I would have opted for SQLite. Be warned that the etch version of ulogd doesn't automatically reconnect to the MySQL server should the connection break for any reason. I backported the lenny version to etch to get around that. (You also need to provide the reconnect and connect_timeout options.)

Besides the reconnection issue, the SQL implementations are quite nice. They have a set schema, and you just need to create a table with the columns in it that you are interested in. No other configuration (beyond connection details) is necessary.

My MySQL table:

51&q=CREATE&lr=lang_en">CREATE 51&q=TABLE&lr=lang_en">TABLE `ulog` (

  `id` 51&q=INT&lr=lang_en">int(10) 51&q=UNSIGNED&lr=lang_en">unsigned 5.1/en/non-typed-operators.html">NOT 51&q=NULL&lr=lang_en">NULL 51&q=AUTO_INCREMENT&lr=lang_en">auto_increment,

  `oob_time_sec` 51&q=INT&lr=lang_en">int(10) 51&q=UNSIGNED&lr=lang_en">unsigned 5.1/en/non-typed-operators.html">NOT 51&q=NULL&lr=lang_en">NULL,

  `oob_prefix` 5.1/en/string-functions.html">char(4) 5.1/en/non-typed-operators.html">NOT 51&q=NULL&lr=lang_en">NULL,

  `ip_totlen` 51&q=SMALLINT&lr=lang_en">smallint(5) 51&q=UNSIGNED&lr=lang_en">unsigned 5.1/en/non-typed-operators.html">NOT 51&q=NULL&lr=lang_en">NULL,

  51&q=PRIMARY%20KEY&lr=lang_en">PRIMARY KEY  (`id`),

  51&q=UNIQUE&lr=lang_en">UNIQUE 51&q=KEY&lr=lang_en">KEY `id` (`id`),

  51&q=KEY&lr=lang_en">KEY `oob_prefix` (`oob_prefix`),

  51&q=KEY&lr=lang_en">KEY `oob_time_sec` (`oob_time_sec`)

);

My ulogd.conf:

[global]

# netlink multicast group (the same as the iptables --ulog-nlgroup param)

nlgroup=1    

# logfile for status messages

logfile="/var/log/ulog/ulogd.log"    

# loglevel: debug(1), info(3), notice(5), error(7) or fatal(8)

loglevel=5    

# socket receive buffer size (should be at least the size of the

# in-kernel buffer (ipt_ULOG.o 'nlbufsiz' parameter)

rmem=131071    

# libipulog/ulogd receive buffer size, should be > rmem

bufsize=150000

# ulogd_BASE.so - interpreter plugin for basic IPv4 header fields

#             you will always need this

plugin="/usr/lib/ulogd/ulogd_BASE.so"

plugin="/usr/lib/ulogd/ulogd_MYSQL.so"

[MYSQL]

table="ulog"

pass="foo"

user="ulog"

db="ulog"

host="localhost"

reconnect=5

connect_timeout=10

The relevant parts of my firewall rules:

# Count proxy usage (transparent and explicit)

iptables -A count-from-inside -p ! tcp -j RETURN

iptables -A count-from-inside -p tcp -m multiport --destination-ports ! 3128,8080 -j RETURN

iptables -A count-from-inside -s 10.0.0.16/28 -j ULOG --ulog-nlgroup 1 --ulog-qthreshold 50 --ulog-cprange 4 --ulog-prefix sr-p

iptables -A count-from-inside -s 10.0.0.32/28 -j ULOG --ulog-nlgroup 1 --ulog-qthreshold 50 --ulog-cprange 4 --ulog-prefix fb-p

iptables -A count-from-inside -s 10.0.0.128/25 -j ULOG --ulog-nlgroup 1 --ulog-qthreshold 50 --ulog-cprange 4 --ulog-prefix gu-p

iptables -A count-to-inside -p ! tcp -j RETURN

iptables -A count-to-inside -p tcp -m multiport --source-ports ! 3128,8080 -j RETURN

iptables -A count-to-inside -d 10.0.0.16/28 -j ULOG --ulog-nlgroup 1 --ulog-qthreshold 50 --ulog-cprange 4 --ulog-prefix sr-p

iptables -A count-to-inside -d 10.0.0.32/28 -j ULOG --ulog-nlgroup 1 --ulog-qthreshold 50 --ulog-cprange 4 --ulog-prefix fb-p

iptables -A count-to-inside -d 10.0.0.128/25 -j ULOG --ulog-nlgroup 1 --ulog-qthreshold 50 --ulog-cprange 4 --ulog-prefix gu-p

# Count forwarded traffic (excluding local internet connection - ppp2)

iptables -A count-forward-in -i ppp2 -j RETURN

iptables -A count-forward-in -d 10.0.0.16/28 -j ULOG --ulog-nlgroup 1 --ulog-qthreshold 50 --ulog-cprange 4 --ulog-prefix sr-f

iptables -A count-forward-in -d 10.0.0.32/28 -j ULOG --ulog-nlgroup 1 --ulog-qthreshold 50 --ulog-cprange 4 --ulog-prefix fb-f

iptables -A count-forward-in -d 10.0.0.128/25 -j ULOG --ulog-nlgroup 1 --ulog-qthreshold 50 --ulog-cprange 4 --ulog-prefix gu-f

iptables -A count-forward-out -o ppp2 -j RETURN

iptables -A count-forward-out -s 10.0.0.16/28 -j ULOG --ulog-nlgroup 1 --ulog-qthreshold 50 --ulog-cprange 4 --ulog-prefix sr-f

iptables -A count-forward-out -s 10.0.0.32/28 -j ULOG --ulog-nlgroup 1 --ulog-qthreshold 50 --ulog-cprange 4 --ulog-prefix fb-f

iptables -A count-forward-out -s 10.0.0.128/25 -j ULOG --ulog-nlgroup 1 --ulog-qthreshold 50 --ulog-cprange 4 --ulog-prefix gu-f

# Glue

iptables -A INPUT -i eth0 -j count-from-inside

iptables -A OUTPUT  -o eth0 -j count-to-inside

iptables -A FORWARD -i ppp+ -j count-forward-in

iptables -A FORWARD -o ppp+ -j count-forward-out

So, traffic for my /28 (sr) will be counted as sr-f or sr-p so I can tally up proxy & forwarded traffic separately. (Yes, I can count traffic with squid too, but doing it all in one place is simpler.) fb is random housemate Foo Bar, and gu guest (unreserved IP addresses).

You can query the usage this month with for example:

51&q=SELECT&lr=lang_en">SELECT oob_prefix, 5.1/en/group-by-functions-and-modifiers.html">SUM(ip_totlen) 51&q=FROM&lr=lang_en">FROM ulog 51&q=WHERE&lr=lang_en">WHERE oob_time_sec > 5.1/en/date-and-time-functions.html">UNIX_TIMESTAMP('2008-04-01 00:00:00') 51&q=GROUP%20BY&lr=lang_en">GROUP BY oob_prefix;

Your table will fill up fast. We are averaging around 200 000 rows per day. So obviously some aggregation is in order:

And every night, run something like:

51&q=INSERT&lr=lang_en">INSERT 51&q=INTO&lr=lang_en">INTO daily (51&q=TIME&lr=lang_en">time, oob_prefix, 51&q=DATA&lr=lang_en">data)

51&q=SELECT&lr=lang_en">SELECT 5.1/en/date-and-time-functions.html">FROM_UNIXTIME(5.1/en/group-by-functions-and-modifiers.html">MAX(oob_time_sec)), oob_prefix, 5.1/en/group-by-functions-and-modifiers.html">SUM(ip_totlen)

51&q=FROM&lr=lang_en">FROM ulog

51&q=WHERE&lr=lang_en">WHERE oob_time_sec >= 5.1/en/date-and-time-functions.html">UNIX_TIMESTAMP('2008-04-01 00:00:00')

  5.1/en/non-typed-operators.html">AND oob_time_sec < 5.1/en/date-and-time-functions.html">UNIX_TIMESTAMP('2008-04-02 00:00:00')

51&q=GROUP%20BY&lr=lang_en">GROUP BY oob_prefix;

51&q=DELETE&lr=lang_en">DELETE 51&q=FROM&lr=lang_en">FROM ulog 51&q=WHERE&lr=lang_en">WHERE oob_time_sec  >= 5.1/en/date-and-time-functions.html">UNIX_TIMESTAMP('2008-04-01 00:00:00')

  5.1/en/non-typed-operators.html">AND oob_time_sec < 5.1/en/date-and-time-functions.html">UNIX_TIMESTAMP('2008-04-02 00:00:00');

Finally, I have a simple little PHP script that provides reporting and calculates dues. Done.

Comments

Mon, 07/04/2008 - 5:50pm — Dominic White (not verified)

Nice, but any ideas on username logging

This looks great. I am in a similar situation, but have a shared host sitting at rackspace which people often tunnell (ssh -D) through. I would like to do per username logging. Any ideas on where I should look (I’ve done some searching) or how/

Mon, 07/04/2008 - 10:01pm — tumbleweed

IPTables mod_owner

Match based on uid. Naturally you’ll only be able to log traffic from the machine itself, not traffic to it (owned by sshd). But if users are only tunnelling, then just double any figure this produces.

It’s a little tricky, because the “owner” module only works in OUTPUT. So we have to use some connmark foo e.g.

iptables -A OUTPUT -m owner --uid-owner 1000 -j CONNMARK --set-mark 0x1
iptables -A OUTPUT -m connmark --mark 0x1 -j LOG --ulog-nlgroup 1 --ulog-qthreshold 50 --ulog-cprange 4 --ulog-prefix uid1000
iptables -A INPUT -m connmark --mark 0x1 -j LOG --ulog-nlgroup 1 --ulog-qthreshold 50 --ulog-cprange 4 --ulog-prefix uid1000

Wed, 25/06/2008 - 1:13pm — falde (not verified)

per-user logging.

This requires a unique setting per user. That could be quite messy in a environment where there are thousands of users…

Wed, 02/07/2008 - 10:47pm — tumbleweed

Yup

And I can’t think of any workaround for that

Thu, 10/04/2008 - 12:11am — Jason (not verified)

I shared a dedicated server

I shared a dedicated server with four other people and started on this exact same thing a few years ago. I got sidetracked and never finished it up. Today there was some talk of one or more of the people pulling out so I’m no looking at some other hosting services. Of course, the key thing is knowing how much bandwidth I’m using. So, I started back on my long abandoned ulogd project and fortunately I came across your post which has been a great help in jogging my memory! One question though, I remember somebody saying that ip_totlen’s units were 32 bit words (not bytes or kb or bits). Do you know if that’s correct?

Thu, 10/04/2008 - 11:54am — tumbleweed

Hmm

I haven’t tested, but I’ve noticed almost a perfect corrolation between ip_totlen and my ISP’s bandwidth records. So I’m pretty sure it measures octets (bytes).

BTW, if you want a simple interface-wide bandwidth accounting solution, vnstat is awesome.

Mon, 15/09/2008 - 4:32am — Anonymous (not verified)

why don’t you use

why don’t you use chillispot ? I use it for accounting purpose and it now server 200+ user daily. :D good money do come easy…

Tue, 16/09/2008 - 12:40pm — tumbleweed

Do it myself

I always prefer to do such things myself. I learn more that way.

Captive portals are also out of the question. I don’t want to have to deal with those unless I’m at an airport.

Sat, 21/11/2009 - 3:47am — gfa (not verified)

Navigation