Regular expressions are a powerful tool for working with text data in Python. A regular expression is a sequence of characters that define a search pattern, which can be used for pattern-matching, replacing, or extracting specific information from a text. The re
module of Python provides support for regular expressions.
In this article, I have added some simple examples and exercises to demonstrate the use of regular expressions in Python. Check out these examples to get a clear idea of how regular expressions work.
Let’s jump right in.
1. Importing the re module
import re
2. Basic pattern matching using search()
import re
pattern = "Python"
text = "I write code in Python."
match = re.search(pattern, text)
print("Match found:", bool(match))
Output:
data:image/s3,"s3://crabby-images/bc19f/bc19fe175101b7fc9a416e8187c14958b9275fec" alt=""
3. Using findall() to find all occurrences of a pattern
import re
pattern = "a"
text = "I love to code in Python and it's amazing!"
matches = re.findall(pattern, text)
print("Matches found:", len(matches))
Output:
data:image/s3,"s3://crabby-images/6e2e4/6e2e4461263e52244526bd6affb6315e4f9b1f23" alt=""
4. Using split() to split a string based on a pattern
import re
pattern = "\s"
text = "I like Python and it is amazing!"
split_text = re.split(pattern, text)
print("Split text:", split_text)
Output:
data:image/s3,"s3://crabby-images/9a6ee/9a6eeba613ccdc3693e5468a053e5f0d34f39afc" alt=""
5. Using sub() to replace a pattern with a string
import re
pattern = "Python"
replacement = "Java"
text = "Python is fun"
substituted_text = re.sub(pattern, replacement, text)
print("Substituted text:", substituted_text)
Output:
data:image/s3,"s3://crabby-images/eb439/eb439ea0cc5aa89527762841222a79a2b5b2cb0b" alt=""
6. Using ^ to match the start of a string
import re
pattern = "^I"
text = "I love Python!"
match = re.search(pattern, text)
print("Match found:", bool(match))
Output:
data:image/s3,"s3://crabby-images/c034f/c034ffd224d15084ed5d6ea39326a4882fbc80cf" alt=""
7. Using $ to match the end of a string
import re
pattern = "Python!$"
text = "Python programming is great!"
match = re.search(pattern, text)
print("Match found:", bool(match))
Output:
data:image/s3,"s3://crabby-images/95038/95038e81764188a6f4fcea5d379f5e2663d2d3c0" alt=""
8. Using [] to create a character set
import re
pattern = "[Pp]ython"
text = "I write code in python and Python!"
matches = re.findall(pattern, text)
print("Matches found:", len(matches))
Output:
data:image/s3,"s3://crabby-images/03b4b/03b4bfd67fa83ef8468242e13357ce0c6e33cdde" alt=""
9. Using . to match any character
import re
pattern = "Py...n"
text = "I love Python, Pyt3on, Py45n, and Py@#n!"
matches = re.findall(pattern, text)
print("Matches found:", len(matches))
Output:
data:image/s3,"s3://crabby-images/204ab/204ab00a005da859a087e8b6ee8bbf02da5fb293" alt=""
10. Using * to match zero or more occurrences of a character
import re
pattern = "Py.*n"
count = 0
text = ["Python coding", "Pyt3on","Java", "Py45n", "Py@#n","Pyn"]
for i in text:
if(re.findall(pattern, i)):
count+=1
print("Matches found:", count)
Output:
data:image/s3,"s3://crabby-images/835b2/835b21a855d04d6c4916545cdee16d073145cd7a" alt=""
11. Using + to match one or more occurrences of a character
import re
pattern = "Py.+n"
count = 0
text = ["Python coding", "Pyt3on","Java", "Py45n", "Py@#n","Pyn"]
for i in text:
if(re.findall(pattern, i)):
count+=1
print("Matches found:", count)
data:image/s3,"s3://crabby-images/ad5b9/ad5b9828cab405a13685c8e6fe32817b9b4717e1" alt=""
12. Using ? to match zero or one occurrence of a character
import re
pattern = "Py.?n"
count = 0
text = ["Python coding", "Pyt3on","Java", "Py45n", "Py@#n","Pyn"]
for i in text:
if(re.findall(pattern, i)):
count+=1
print("Matches found:", count)
Output:
data:image/s3,"s3://crabby-images/84749/847495c9c5e1991ee1eef0d36236963fb129ca80" alt=""
13. Using {} to specify the number of occurrences of a character
import re
pattern = "Py{1,2}n"
text = "I love Pyn, Pyyn, and Pyyyn!"
matches = re.findall(pattern, text)
print("Matches found:", len(matches))
Output:
data:image/s3,"s3://crabby-images/083b1/083b12ef60378127b07f4f67c69ee6504c7f35dc" alt=""
14. Using \d to match digits
import re
pattern = "\d+"
text = "My phone number is 123-456-7890."
matches = re.findall(pattern, text)
print(matches)
print("Matches found:", len(matches))
Output:
data:image/s3,"s3://crabby-images/557e4/557e4cb4a01c1810fa9fa70f9288539a6d467ece" alt=""
15. Using \w to match alphanumeric characters
import re
pattern = "\w+"
text = "I_love_Python!"
matches = re.findall(pattern, text)
print(matches)
print("Matches found:", len(matches))
Output:
data:image/s3,"s3://crabby-images/cc471/cc4710bb68b6cbb967131a0ca3d43501656f5500" alt=""
16. Using \s to match whitespace characters
import re
pattern = "\s+"
text = "I love Python!"
matches = re.findall(pattern, text)
print("Matches found:", len(matches))
Output:
data:image/s3,"s3://crabby-images/3bf48/3bf482085af1ccd32e91d6e267dda65d8ba81439" alt=""
17. Using | as an OR operator
import re
pattern = "Python|Java"
text = "I write code in Python and Java!"
matches = re.findall(pattern, text)
print("Matches found:", len(matches))
Output:
data:image/s3,"s3://crabby-images/3bf48/3bf482085af1ccd32e91d6e267dda65d8ba81439" alt=""
18. Using () to group patterns
import re
pattern = "(\d{3})-(\d{3})-(\d{4})"
text = "My phone number is 123-456-7890."
match = re.search(pattern, text)
if match:
print("Area code:", match.group(1))
print("Local exchange:", match.group(2))
print("Line number:", match.group(3))
Output:
data:image/s3,"s3://crabby-images/2f405/2f405e53a631a6bc340625a8144cb4f65713a89b" alt=""
I hope this article was helpful. Check out my post on 15 Python String Manipulation Exercises.
Great exercises.
Thank you for sharing this.