Error while parsing xml in Python

Question

I'm trying to parse this: http://www.codespot.blogspot.in/atom.xml?redirect=false&start-index=1&max-results=500

Problem being:

I've to store the xml in a file for ElementTree to parse it. How to avoid it and just parse the string response from the GET request?

Though I'm doing this, to get all the titles, it still doesn't work:

f = open('output.xml','wb+')
    f.write(r.content)
    f.close()
    tree = ""
    with open('output.xml', 'rt') as f:
        tree = ElementTree.parse(f)
        print tree
        root = tree.getroot()
        for elem in tree.iter():
            print elem.tag, elem.attrib
        for atype in tree.findall('title'):
            print atype.contents

To parse a string and not a file, you use ElementTree.fromstring(string), but you don't need to do that. namit found the correct namespace use before me. :) — Lennart Regebro
– Lennart Regebro, Commented Apr 17, 2013 at 5:11

namit · Accepted Answer · 2013-04-17 05:18:26Z

2

import urllib2
from xml.etree import cElementTree as ET
conn = urllib2.urlopen("http://www.codespot.blogspot.in/atom.xml?redirect=false&start-index=1&max-results=500")
myins=ET.parse(conn)
for elem in myins.findall('{http://www.w3.org/2005/Atom}entry/{http://www.w3.org/2005/Atom}title'):
    print elem.text

or to find the both title and content::

for elem in myins.findall('{http://www.w3.org/2005/Atom}entry'):
    print elem.find('{http://www.w3.org/2005/Atom}title').text ## this will be the title
    print elem.find('{http://www.w3.org/2005/Atom}content').text ## this will be the content

edited Apr 17, 2013 at 5:18

answered Apr 17, 2013 at 5:08

namit

6,9874 gold badges37 silver badges41 bronze badges

Sign up to request clarification or add additional context in comments.

Collectives™ on Stack Overflow

Error while parsing xml in Python

1 Answer 1

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Related