Skip to content

can extract aws loadbalancer access logs into elastic (in logstash format)

Notifications You must be signed in to change notification settings

doddle/aws-accesslog-parser

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Quickstart

  1. sync the logs u want locally

First download the logs you want.. EG:

export YEAR=2020
export MONTH=03
export DAY=06
export HOUR=16
export S3BUCKET=my-alb-log-bucket
export AWS_ACCOUNT_ID=123321321321
export REGION=us-east-1
export ALB_NAME=ingress
export SOMETHING=alb-foo
mkdir -p ./logs/${YEAR}/${MONTH}/${DAY}/
aws s3 sync \
  s3://${S3BUCKET}/${SOMETHING}/AWSLogs/${AWS_ACCOUNT_ID}/elasticloadbalancing/${REGION}/${YEAR}/${MONTH}/${DAY}/ \
  ./logs/${YEAR}/${MONTH}/${DAY}/ \
  --exclude="*" \
  --include "*${ALB_NAME}*${YEAR}${MONTH}${DAY}T${HOUR}*"
  1. docker-compose up

  2. parse logs into elastic

find ./logs/${YEAR}/${MONTH}/${DAY} -type f -name "*.gz" -exec ./main.py -f {} \;

setup your dev env

elastic+kibana

  1. get es + kibana data dir ready.. we need it owned by 1000:0
mkdir data-elasticsearch
sudo chmod g+rwx data-elasticsearch
sudo chown -R 1000:0 data-elasticsearch
  1. docker-compose up

  2. disable kibana telemetry

once its up.. u can disable telemetry like so:

curl -v http://localhost:5601/api/telemetry/v2/optIn \
  -H "kbn-version: 7.6.0" \
  -H "Accept: application/json, text/plain, */*" \
  -H "Content-Type: application/json;charset=utf-8" \
  --data '{"enabled":false}'

python venv (optional)

  1. create a python venv in localenv dir
python3 -m venv localvenv
  1. activate the venv

(or use direnv allow)

source localvenv/bin/activate
  1. install the deps
pip install --upgrade pip
pip install -r requirements.txt
  1. write code

  2. don't forget to freeze deps

pip freeze --path ./localvenv/lib/python3.8/site-packages > requirements.txt

Credits

The fastest way I've found to parse the lines is with the regex by @jweyrich here

About

can extract aws loadbalancer access logs into elastic (in logstash format)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%