Pattern Matching Lab
Pattern matching is used throughout IT for a multitude of reasons and can be used to identify transit or at rest data. Pattern matching can lead to the classification of data, which can cause problems, as in the case of a false positive. Where you have wrongly classified data based on a particular pattern but the data belongs to another class. In t...
Already have an account? Sign In »

Pattern matching is used throughout IT for a multitude of reasons and can be used to identify transit or at rest data. Pattern matching can lead to the classification of data, which can cause problems, as in the case of a false positive. Where you have wrongly classified data based on a particular pattern but the data belongs to another class. In this lab, we are going to use the kddcup.data.corrected dataset, which contains traffic flows captured by TCPDUMP. We will use the Python programming language to detect patterns in the said dataset. We will illustrate how a false positive can occur, and we will also demonstrate how a Python program can be used to detect this error. Lastly, we will correct the error by altering the pattern within our program and verify that there are no more false positives.