GithubHelp home page GithubHelp logo

jdenoy / as-stats Goto Github PK

View Code? Open in Web Editor NEW

This project forked from manuelkasper/as-stats

0.0 2.0 0.0 2.58 MB

A simple tool to generate per-AS traffic graphs from NetFlow/sFlow records

License: BSD 2-Clause "Simplified" License

Perl 42.84% Shell 13.84% Python 1.07% PHP 39.06% CSS 3.19%

as-stats's Introduction

AS-Stats v1.5 (2014-01-29)

A simple tool to generate per-AS traffic graphs from NetFlow/sFlow records
by Manuel Kasper [email protected] for Monzoon Networks AG

How it works

A Perl script (asstatd.pl) collects NetFlow v8/v9 AS aggregation records or sFlow v5 samples from one or more routers. It caches them for about a minute (to prevent excessive writes to RRD files), identifies the link that each record refers to (by means of the SNMP in/out interface index), maps it to a corresponding "known link" and RRD data source, and then runs RRDtool. To avoid losing new records while the RRD files are updated, the update task is run in a separate process.

For each AS, a separate RRD file is created as needed. It contains two data sources for each link - one for inbound and one for outbound traffic. In generated per-AS traffic graphs, inbound traffic is shown as positive, while outbound traffic is shown as negative values.

Another Perl script, rrd-extractstats.pl, is meant to run about once per hour. It sums up per-AS and link traffic during the last 24 hours, sorts the ASes by total traffic (descending) and writes the results to a text file. This is then used to display the "top N AS" and other stats by the provided PHP scripts.

Prerequisites

  • Perl 5.8
  • RRDtool 1.2 (with Perl "RRDs" library)
  • if using sFlow: the Net::sFlow module (CPAN)
  • web server with PHP 5
  • one or more routers than can generate NetFlow v8/v9 AS aggregation records or sFlow samples

Installation

  • Copy the perl scripts asstatd.pl and rrd-extractstats.pl to the machine that will collect NetFlow/sFlow records

  • Create a "known links" file with the following information about each link that you want to appear in your AS stats:

    • IP address of router (= source IP of NetFlow datagrams)
    • SNMP interface index of interface (use "show snmp mib ifmib ifindex" to find out)
    • a short "tag" (12 chars max., a-z A-Z 0-9 _ only) that will be used internally (e.g. for RRD DS names)
    • a human-readable description (will appear in the generated graphs)
    • a color code for the graphs (HTML style, 6 hex digits)
    • the sampling rate (or 1 if you're not using sampling on the router)

    See the example file provided (knownlinks) for the format.
    Important: you must use tabs, not spaces, to separate fields!

  • Create a directory to hold per-AS RRD files. For each AS, about 128 KB of storage are required, and there could be (in theory) up to 64511 ASes. AS-Stats automatically creates 256 subdirectories in this directory for more efficient storage of RRD files (one directory per lower byte of AS number, in hex).

  • Start asstatd.pl in the background (or, better yet, write a startup script for your operating system to automatically start asstatd.pl on boot):

    nohup asstatd.pl -r /path/to/rrd/dir -k /path/to/knownlinks &

    By default, asstatd.pl will listen on port 9000 (UDP) for NetFlow datagrams, and on port 6343 (UDP) for sFlow datagrams. Use the -p/-P options if you want to change that (use 0 as the port number to disable either protocol). For sFlow, you also need to specify your own AS number with the -a option for accurate classification of inbound and outbound traffic. It's a good idea to make sure only UDP datagrams from your trusted routers will reach the machine running asstatd.pl (firewall etc.).

  • NetFlow only: Have your router(s) send NetFlow v8 or v9 AS aggregation records to your machine. This is typically done with commands like the following (Cisco IOS):

      ip flow-cache timeout active 5
    
      int Gi0/x.y
        ip flow ingress
    
      ip flow-export source <source interface>
      ip flow-export version 5 origin-as
      ip flow-aggregation cache as
       cache timeout active 5
       cache entries 16384
       export destination <IP address of server running AS stats> 9000
       enabled
    

    Adjust the number of cache entries if necessary (i.e. if you get messages like "Netflow as aggregation cache is almost full" in the logs).

    Note that the version has to be specified as 5, even though the AS aggregation records will actually be v8. Also, setting the global flow cache timeout to 5 minutes is necessary to get "smooth" traffic graphs (default is 30 minutes), as a flow is only counted when it expires from the cache. Decreasing the flow-cache timeout may result in a slight increase in CPU usage (and NetFlow AS aggregation takes its fair share of CPU as well, of course).

    Routers with MLS (Multi-Layer Switching, e.g. Cisco 7600 series) require additional commands like the following in order to enable NetFlow processing/aggregation for packets processed in hardware:

      mls aging fast time 4 threshold 2
      mls aging long 128
      mls aging normal 64
      mls flow ip interface-full
    

    For IOS XR, the configuration looks as follows:

      flow exporter-map FEM
       version v9
       !
       transport udp 9000
       source <source interface>
       destination <IP address of server running AS stats> vrf default
    
      flow monitor-map IPV4-FMM
       record ipv4
       exporter FEM
       cache entries 16384
       cache timeout active 300
      !
      flow monitor-map IPV6-FMM
       record ipv6
       exporter FEM
       cache entries 16384
       cache timeout active 300
      !
    
      sampler-map SM
       random 1 out-of 10000
    
      router bgp 100
        address-family ipv4 unicast
         bgp attribute-download
        address-family ipv6 unicast
         bgp attribute-download
    

    For JunOS, the configuration looks as follows:

      forwarding-options {
      	sampling {
      		input {
      			rate 2048;
      			max-packets-per-second 4096;
      		}
      		family inet {
      			output {
      				flow-active-timeout 60;
      				flow-server x.x.x.x {
      					port 9000;
      					autonomous-system-type origin;
      					aggregation {
      						autonomous-system;
      					}
      					version 8;
      				}
      			}
      		}
      	}
      }
    

    JunOS IPFIX configuration:

      chassis {
      	tfeb {
      		slot 0 {
      			sampling-instance flow-ipfix;
      		}
      	}
      }
      interfaces {
      	ge-1/0/0 {
      		unit 0 {
      			family inet {
      				sampling {
      					input;
      					output;
      				}
      			}
      		}
      	}
      }
      forwarding-options {
      	sampling {
      		instance {
      			flow-ipfix {
      				input {
      					rate 1;
      				}
      				family inet {
      					output {
      						flow-server 192.0.2.10 {
      							port 9000;
      							autonomous-system-type origin;
      							no-local-dump;
      							version-ipfix {
      								template {
      									ipv4;
      								}
      							}
      						}
      						inline-jflow {
      							source-address 192.0.2.1;
      						}
      					}
      				}
      			}
      		}
      	}
      }
      services {
      	flow-monitoring {
      		version-ipfix {
      			template ipv4 {
      				flow-active-timeout 60;
      				flow-inactive-timeout 60;
      				template-refresh-rate {
      					packets 1000;
      					seconds 10;
      				}
      				option-refresh-rate {
      					packets 1000;
      					seconds 10;
      				}
      				ipv4-template;
      			}
      		}
      	}
      }
    
  • sFlow only: Have your router(s) send sFlow samples to your machine. Your routers may need a software upgrade to make them include AS path information for both inbound and outbound packets (this is a good thing to check if your graphs only show traffic on one direction).

  • Wait 1-2 minutes. You should then see new RRD files popping up in the directory that you defined/created earlier on. If not, make sure that asstatd.pl is running, not spewing out any error messages, and that the NetFlow/sFlow datagrams are actually reaching your machine (tcpdump...).

  • Add a cronjob to run the following command every hour:

    rrd-extractstats.pl /path/to/rrd/dir /path/to/knownlinks \ /path/to/asstats_day.txt

    That script will go through all RRD files and collect per-link summary stats for each AS, sort them by total traffic (descending), and write them to a text file. The "top N AS" page uses this to determine which ASes to show.

  • Copy the contents of the "www" directory to somewhere within your web server's document root and change file paths in config.inc as necessary.

  • Make the directory "asset" within www writable by the web server (this is used to cache AS-SETs and avoid having to query whois for every request).

  • Wait a few hours for data to accumulate. :)

  • Access the provided PHP scripts via your web server and marvel at the (hopefully) beautiful graphs.

Adding a new link

Adding a new link involves adding two new data sources to all RRD files. This is a bit of a PITA since RRDtool itself doesn't provide a command to do that. A simple (but slow) Perl script that is meant to be used with RRDtool's XML dump/restore feature is provided (add_ds_proc.pl, add_ds.sh). Note that asstatd.pl should be stopped while modifying RRD files, to avoid breaking them with concurrent modifications.

Changing the RRAs

By default, the created RRDs keep data as follows:

* 48 hours at 5 minute resolution
* 1 week at 1 hour resolution
* 1 month at 4 hour resolution
* 1 year at 1 day resolution

If you want to change that, modify the getrrdfile() function in asstatd.pl and delete any old RRD files.

Support

A mailing list is available at https://groups.google.com/d/forum/as-stats-users. Please do not send requests for help/support directly to the author.

To do

  • rrd-extractstats.pl uses a lot of memory and could probably use some optimization.

as-stats's People

Contributors

manuelkasper avatar nshopik avatar gregoo avatar marcbanyard avatar josqu4red avatar buchtajz avatar xmorpheus avatar will-h avatar

Watchers

Johan Denoyer avatar James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.