Logging course: Introduction


basename	fold	split
cat	head	tail
comm	join	tac
cut	md5sum	tee
date	[]/test	tr
dirname	paste	touch
echo	pr	true
expand/unexpand	seq	uniq
false	sleep	wc
fmt	sort	...

Name	Use-cases
mutate	Rename/Normalize fields, change casing, remove fields
drop	Stop processing/forwarding of event
grok	Use regex to match events and extract field data
cipher	Pseudonomization of PII/credentials, data decoding
ruby	Whatever you can think of!

Welcome participants and wait for everyone to get settled. Introduction of the lecturers and their background. Segue: In this course we'll talk about logging...

Understand the run-time behavior and activity in modern IT systems.

Attacks or misuse. Fundamental for TD and IR.

Discover the benefits of centralized collection, normalization and analysis of log data.

Essential part to comply with most IT related laws/compliance schemes.

With the exception of the first lab, remote system provided by teacher. Use Vagrant or whatever you're comfortable with to setup a lab system.

- We'll cover lots of things in a short amount of time - In order to be able to do this we'll use scientifically proven methods to Make It Stick - Basically what the slide says - Don't forget to have fun! - If available, show detailed course schedule

- There are several resources to help you learn - Speaker notes in slides are heavily recommended for recaps/deep diving - May also be available through LMS, depending on how the course is consumed - The course is designed to be instructor lead, won't make the most of it on your own, see as aid - Presentations may be recorded, but only the speaker side for good and bad

The course wouldn't be available if it wasn't for financial support - Thanks!

- Encourage participants to make the course better - Learners are likely the best to provide critique, lecturers are likely a bit home-blind - No cats or dogs allowed! - Feel free to share it with friends or use it yourself later in your career

The term "log book" is old. Use as anectode: https://upload.wikimedia.org/wikipedia/commons/a/a8/Speyer_Handlog.jpg

- Confusing name, far from black! - Semi-automated system used to record what happened in/to the airplane. - Help us understand accidents and prevent future ones. Segue: We also use logging in computer systems...

- Review logs for IoCs and undesired activity. - Just the knowledge of that activity is monitored may deter undesired activity. - Help us understand how things actually work, why they don't and where to improve - Behavior of users/customers in our services - GDPR/PCI DSS requires logging of access to PII/credit card information

- Depends on what type of log we are talking about. Segue: Two broad categories...

- Why is the system inaccessible? - What is causing request latency? - Typically helps developers, system administrators and business analysists - A good operational log helps these people do their jobs

- Primarily interesting for security related roles. - Play detective with red strings and a "crazy wall" Segue: So what makes a good audit log entry?

- The 5 W:s of audit logging - Each log entry should ideally answer these questions.

- Essential for putting events in cronological order - In distributed systems, accurate time is crucial for correlation of events - More about time/clocks later!

- Useful context for events - Which user/system administrator could I ask/question about the happening - Is it resonable that a guy in sales is trying to access IT management systems? - How about a recently fired (disgruntled) employee who are trying to download all shared files? - Goes without saying, but the better the authentication the more we can trust this

- "Failed to authenticate against database due to wrong password" - "Could not delete file due to insufficient privileges" - "Safe-door unlocked"

- Event causer == human/computer - Help us put to make sense of the event - Is it resonable that Janne is trying to access their email from Murmansk? - Was the action performed from an IP address or computer controlled by the organization? - Can't always trust this information Segue: And lastly... why?

- Searches in the police data registry - Ticket ID/Documentation for why a firewall exception was added - May not be provided by a human, but rather another system to make sense of events - "This database entry was deleted due to user X performing action Y in system Z" Segue: Now that we know what should be in the audit log entry, how do we present the info?

- These logs will most likely be monitored by computers and analyzed by humans - Clear separation of individual events - Clear separation of the 5 W:s, should be easy to differentiate betwen when, what... - More about different log formats and their pros/cons late

- Some type of events are hard to categorize. - An application's permission failure to access a database may be of interest of both ops and sec - Often all logs are written to the same file/database table - A large part of the job in a SOC is filtering logs for relevant events

We know what we want, how do we actually get ahold of these logs?

- Sometimes known as "black box observability" (not to be confused with airplanes) - Useful for legacy systems who haven't been designed to produce desired logs Segue: Quite low-level, may be hard to answer W:s except when and where....

- The application - Prefered, but may be costly/very hard to implement - Requires cooperation from software/system developer

- As we've talked about audit logging, security personnel are a given consumer Segue: But there are also others who are interested...

- Let's be a bit more specific

- A/B testing == What effect did change X have on metrix Y? - Some businesses make their living on selling user behavior data to others

Are we reading each log event, row for row? No.

- Why is it so neat to have computers monitor the logs for us?

Example: Fail2Ban, automated order of disks based on total utilization

- Usually simple counters or gauges - Scraped and stored in a time-series database

Fail open or closed? Auditd is an example

https://upload.wikimedia.org/wikipedia/commons/e/ec/World_Time_Zones_Map.svg

https://www.netnod.se/sites/default/files/2022-06/NTS-FPGA-presentation-christer.pdf

https://en.wikipedia.org/wiki/List_of_GNU_Core_Utilities_commands

Examples of vertical access control: - Delete data (retention rules) - Modify detection/scrubbing rules

https://www.riksdagen.se/sv/dokument-och-lagar/dokument/svensk-forfattningssamling/lag-2022482-om-elektronisk-kommunikation_sfs-2022-482/

- Origins in 50s, use heavily in computers since late 60s

https://user-images.githubusercontent.com/20878432/43869313-29afa944-9b72-11e8-83fa-f8e8859875fc.png

https://go2docs.graylog.org/current/getting_in_log_data/gelf.html#GELFPayloadSpecification https://www.elastic.co/docs/reference/ecs https://www.elastic.co/docs/current/en/integrations/cef https://www.microfocus.com/documentation/arcsight/arcsight-smartconnectors-8.3/cef-implementation-standard/Content/CEF/Chapter%201%20What%20is%20CEF.htm

https://docs.redhat.com/en/documentation/red_hat_enterprise_linux/6/html/security_guide/sec-audit_record_types#sec-Audit_Record_Types

https://sysdig.com/blog/getting-started-writing-falco-rules/ https://falcosecurity.github.io/rules/ https://cilium.io/

https://docs.opensearch.org/latest/search-plugins/sql/ppl/index/

https://docs.opensearch.org/latest/search-plugins/sql/sql/index/

https://opensearch.org/blog/semantic-search-solutions/

https://opensearch.org/docs/latest/search-plugins/conversational-search/

https://fluentbit.io/

https://www.elastic.co/docs/reference/beats/libbeat/community-beats

https://sigmahq.io/

Logging course

Welcome and thanks for joining!

What we will cover

Requires basic knowledge of...

How we will do it

Acknowledgements

Free as in beer and speech

Let us dig in!

Vocabulary and basics

Why do we log?

Operational logs

Audit logs

When?

Who?

What?

(from) Where?

Why?

"Inspection-based"

Instrumented

Development and operations

Security personnel

Data analysts and scientists

"Pattern-based"

Aggregation and correlation

"Anomaly-based"

Normalization

Enrichment

Visualization

Local analysis

Centralized analysis

Ease correlation

Minimizes risk of tampering

Optimize performance and cost

Alerting

Anomaly detection

Logging overhead

Storage and processing costs

Collection of sensitive data

Legal/Compliance challenges

Cost of analysis

Group exercise

Putting knowledge to use

Exercise: Sell em' logging

Org. 1: Xample Bank & Finance

Org. 2: Examplezon Inc.

Org. 3: Exemplum Medical

Org. 4: Examplx Web Services

Reflections exercise

What have we learned so far?

Answer the following questions

Basics recap

Let's refresh our memory

Logging helps us...

Operational logs

Audit logs

Benefits of centralized logging

Beware of...

Time and clocks

A not so scary introduction

Keeping it simple

What is a second anyway?

Time zones

Daylight savings

tz database

Time/Date representation

The two challenges

NTP

Example clients/servers

Weaknesses

NTS

PTP

What's the correct time?

Getting reference time

Using pool.ntp.org

Using ntp.se

Network traffic logging

Resource intensive

Prevalence of encryption

NIDS

IPS