There are different approaches when it comes to logging: to log every input as rawly as possible or to clean up log events and user inputs before saving them. There are pros and cons for both approaches. Whichever path you choose, it is important to remember your choice when analysing these log events. Just.. not to get any surprises on the way. I’ll try to illustrate my point with the following AWS S3 Server Access Log example. Although I’m bringing an example based on S3, please keep in mind that there are other application servers with similar “logging features”. So make sure you have a good overview about how your systems deal with event logging. Prolog I created my S3 bucket and enabled the server access logging. You can refer to . Amazon’s tutorial To analyse the S3 access logs, I downloaded the log files to my local repository and used GNU command-line tools to analyse the events. For fetching the log files I’ve written a that searches for the new log files and and downloads them through . ruby script AWS S3 REST API I’ve uploaded two files to the bucket: the is a publicly accessible document, is a document that is not shared with everyone. This is important to understand the examples later on. public.txt semi.txt Please keep in mind that this is only a demonstration, not a fool-proof attack vector. Log events The is quite similar to the Apache web servers access log. One of the important differences how ever is that ! No validation of data nor escaping non-printable symbols is being done before saving the events to log files. It is not a bug but it is an important aspect to remember when analysing log files later on or when choosing your tools for log analysis. If your logs contain unescaped raw data then your analytical tools have to be ready to deal with malicious content or attacks towards logging. AWS S3 Server Access Logs format AWS S3 server access logs are saved as raw data Wait.. what’s the fuss about? Let me explain by bringing some examples. When requesting a file from your AWS S3 bucket, an HTTP GET request is sent to the AWS, the content of the file will be returned and event will be logged to the server access logs. GET / /public.txt HTTP/1.1Host: s3.eu-central-1.amazonaws.comUser-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.13; rv:61.0) Gecko/20100101 TESTIME Firefox/61.0 <bucket_name> Log event that would be written is as follows: [30/Jul/2018:17:16:55 +0000] 328.496.13.534 — REST.GET.OBJECT public.txt "GET / /public.txt HTTP/1.1" 200 – 19 19 7 7 "-" "Mozilla/5.0 (Macintosh; Intel Mac OS X 10.13; rv:61.0) Gecko/20100101 TESTIME Firefox/61.0" - <bucket_owner> <bucket_name> <request_id> <bucket_name> So far so good. But as I mentioned, in AWS S3 logs the non-printable characters are not escaped. This means that when analysing your logs with command-line tools you must keep in mind that non-printable symbols might be interpreted as escape sequences and the events on the screen might not seem as they are written to the file. This might create confusion when looking at the log files. Lets illustrate this by making another request. GET / /public.txt?a=_<08><08><08><08><08><08><08><08><08><08><08><08><08>_semi.txt HTTP/1.1Host: s3.eu-central-1.amazonaws.comUser-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.13; rv:61.0) Gecko/20100101 TESTIME Firefox/61.0 <bucket_name> The character in the request is the for backspace represented in HEX. The file is still being returned as previously. The log event in the log event on the screen is somewhat different <08> ascii symbol Log event for the unescaped backspaces Since the application server didn’t escape the backspace before writing the event to the log file, terminal interpreted this as a command and removed the 13 characters from display. It is important to understand: the removal took place : log file still contains the original text and backslash characters. The terminal removed the characters from your view. Same file with ‘vi’ or hex editor would reveal the truth only on the screen Log event seen in VI The situation could be even more confusing when using ‘grep’. Grepping the unescaped characters As you can see, I grepped the parameter name which will be deleted when displayed on the screen. Imagine when you stumble upon these kind of events in the middle of an incident: grepping something that is not actually there (e.g. usernames or other user input values). Backspace.. And that‘s it? Well, not quite. Poisoning log files with ASCII characters is not the only possibility to ruin command-line log analysis. Another approach can be with . You’ve probably used these escape sequences and colour codes to create your fancy CLI screens for your terminal. The same approach can be applied here as well. Imagine a request like the following: ANSI escape sequences GET / /public.txt HTTP/1.1Host: s3.eu-central-1.amazonaws.comUser-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.13; rv:61.0) Gecko/20100101 TESTIME Firefox/61.0** [0;49m** <bucket_name> < >[31;49m 1b <1b> See the escape codes before and after the User-Agent, where is the HEX representation of the escape code. When grepping your logs you would see the following output on the screen <1b> Coloured events Well, whoopty doo: colouring the prompt. And that’s it? Well, not quite. ANSI escape sequences allow you to do much more then just colouring your characters in the terminal. The codes in the log file can reposition your cursor on the screen, reconfigure your terminal settings and do lots of other “fun” stuff with it. Coming back to log evasion, consider the following request: GET / /public.txt HTTP/1.1Host: s3.eu-central-1.amazonaws.comUser-Agent: < >[21D401_<1b>_[18CMozilla/5.0 (Macintosh; Intel Mac OS X 10.13; rv:61.0) Gecko/20100101 TESTIME Firefox/61.0 <bucket_name> 1b The escape sequence means, that “ ”. Or in other words: change the response code. move left 21 places, write 401 and move right 18 places Response changed with escape sequence When just browsing your logs, you might find yourself satisfied with the result, that all of the requests have been answered with . Imagine the surprise when an adversary tells you that he actually saw the file. HTTP-40x Or if an adversary would append the User-Agent header value with an escape sequence (“ ), the cursor would be pointing to the beginning of your log event thus over-writing the event with the next one. Or in other words: your log event would be hidden from sight when using GNU command-line tools like ‘ ’ or ‘ ’. These are just a few examples to show what to consider when analysing logs. <1b>[2A move cursor up 2 lines” cat more What should I do now? Input validation is always important! Not only when writing an application but also when logging your applications behaviour. In the examples I brought it’s mainly a matter of taste if you do your validation when writing down the log event or when loading the log events to your analysis environment. Just keep in mind that you have to do it somewhere and design the remaining part of your log analysis environment respectively. There are pros and cons with both approaches, e.g. it’s useful to understand when someone is trying to poison your logs with unexpected input. How ever — instead of saving the data in original format, you might consider escaping the non-printable symbols not whitelisting or ignoring them. It would also be beneficial to search for such non-printable symbols from your logs from time-to-time to see if someone is trying to evade your logging system.

Amazon

Apache

Intel

Log evasion: Log me if You can!

About Author

Comments

TOPICS

THIS ARTICLE WAS FEATURED IN

Related Stories

Android: Location Tracking with a Service

Keep Your Logs Close And Your Code Closer

101 Stories To Learn About Cloud Infrastructure

10 Things in Engineering We Don't Spend Enough Time On

10 Things I Did To Increase CloudTrail Logs Security

10 reasons to give cloud computing a go

Android: Location Tracking with a Service

Keep Your Logs Close And Your Code Closer

101 Stories To Learn About Cloud Infrastructure

10 Things in Engineering We Don't Spend Enough Time On

10 Things I Did To Increase CloudTrail Logs Security

10 reasons to give cloud computing a go

Light-Mode

Classic

Newspaper

Minty

Dark-Mode

Neon Noir

Minty

HN StartUps