Pattern Matching Lab

Infosec Learning
Virtual Lab

Pattern matching is used throughout IT for a multitude of reasons and can be used to identify transit or at rest data. Pattern matching can lead to the classification of data, which can cause problems, as in the case of a false positive. Where you have wrongly classified data based on a particular pattern but the data belongs to another class. In t...

Time
1 hour 30 minutes
Difficulty
Beginner
Share
Join over 3 million cybersecurity professionals advancing their career
Sign up with
or

Already have an account? Sign In »

Overview

Pattern matching is used throughout IT for a multitude of reasons and can be used to identify transit or at rest data. Pattern matching can lead to the classification of data, which can cause problems, as in the case of a false positive. Where you have wrongly classified data based on a particular pattern but the data belongs to another class. In this lab, we are going to use the kddcup.data.corrected dataset, which contains traffic flows captured by TCPDUMP. We will use the Python programming language to detect patterns in the said dataset. We will illustrate how a false positive can occur, and we will also demonstrate how a Python program can be used to detect this error. Lastly, we will correct the error by altering the pattern within our program and verify that there are no more false positives.