Ready to Start Your Career?

By: H5p
April 7, 2017
REGEX & PYTHON

By: H5p
April 7, 2017
[/{REGEX} + /{PYTHON}]
Regex or regular expressions can be used to identify a particular pattern in a large dump of data. This comes handy when there is a requirement to extract/identify a particular pattern of data from a heap of data.
USAGE:
- Log files contain a large amount of data which is not very eye friendly. If we exactly know what we are searching for, regex can help.
- Post enumeration: picking out emails or phone numbers from a large file in case there are any present
I will be taking a small example on how to identify and extract the phone number from a particular piece of data. Will keep the example straightforward so that it is easy to digest for beginners.
QUESTION 1: what we are lo0king for?
- Phone numbers (landline)
QUESTION 2: What is the pattern?
- 011-99999999 (country code - 8 digit landline number)
QUESTION 3: What should be the regex look like?
- 'd{3}-d{8}' -> 3 digits followed by dash and then again 3 digits
Let's bake the code:
C0de fl0w:
- Import re module for regex
- Enter the data string. (string starts with what we are searching for)
- Enter the second data string. (data to be searched in the middle of the string)
- Compile the regex with re.compile function
- Use of match function on data to get the data. (starting of string has the phone number pattern)
- Use of search function to get the data. (will search for only the first occurrence of the string)
- Use of findall function to get the data. (finds all patterns in the string)
