Regex to replace url with a word python

Question

I am trying to replace few urls in a long string.

A sample here:

s = 'https://www.yellowpages.ca/bus/Alberta/Edmonton/MNS-Enterprise-\nLtd/8114324.html, https://411.ca/business/profile/13300641'

Due to the newline character within the url, the match will always stop at \n. I tried

re.sub(r'(https?://[\S]*)', 'website__', s, re.DOTALL)

but the result breaks at \n

'website__\nLtd/8114324.html, website__'

Wiktor Stribiżew · Accepted Answer · 2021-05-13 20:47:23Z

1

You can add \n and use

re.sub(r'https?://[\n\S]+\b', '<URL>', s)

See the regex demo. Details:

https?:// - http:// or https://
[\n\S]+ - one or more newline or non-whitespace chars
\b - until the rightmost word boundary.

See the Python demo:

import re
s = 'https://www.yellowpages.ca/bus/Alberta/Edmonton/MNS-Enterprise-\nLtd/8114324.html, https://411.ca/business/profile/13300641'
print( re.sub(r'https?://[\n\S]+\b', 'website__', s) )
# => website__, website__

answered May 13, 2021 at 20:47

Wiktor Stribiżew

631k41 gold badges502 silver badges632 bronze badges

Sign up to request clarification or add additional context in comments.

Collectives™ on Stack Overflow

Regex to replace url with a word python

1 Answer 1

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Related