From: cristina
Date: Wed, 2 Jul 2008 15:31:17 -0700 (PDT)
Local: Thurs, Jul 3 2008 8:31 am
Subject: Re: python sitemap_gen.py MemoryError
Can you run the sitemap generator more than once
for different config.xml files with different settings for the <directory> node, just to break the sitemaps for different sub-folders, to check if indeed the problem is memory leak because of the large number of URLs, and not some problem because of file system walking. For example first time run the sitemap generator
<directory
change default_file to index.shtml
It is not great, just to check that the
Cristina.
On Jul 2, 9:39 pm, BadXAsh wrote:
> Man I was very confident in that working. I Changed the verbose
> attribute of the site nod in config.xml to 3, and it does say > something about the directory being walked at the very beginning of > the process, but then around sitemap 54 I received this message: > --- > URL: loc=[http://www.diyautoparts.com/search/parts/1985/dodge/aries/ > air-check-valve.shtml] lastmod=[2008-01-17T16:25:38Z] changefreq=[] > priority=[] > URL: loc=[http://www.diyautoparts.com/search/parts/1985/dodge/aries/ > air-conditioning-accumulator.shtml] lastmod=[2008-01-17T16:25:38Z] > changefreq=[] priority=[] > Traceback (most recent call last): > File "sitemap_gen.py", line 2206, in ? > sitemap.Generate() > File "sitemap_gen.py", line 1778, in Generate > input.ProduceURLs(self.ConsumeURL) > File "sitemap_gen.py", line 979, in ProduceURLs > os.path.walk(self._path, PerDirectory, None) > File "/usr/lib/python2.4/posixpath.py", line 298, in walk > walk(name, func, arg) > File "/usr/lib/python2.4/posixpath.py", line 298, in walk > walk(name, func, arg) > File "/usr/lib/python2.4/posixpath.py", line 298, in walk > walk(name, func, arg) > File "/usr/lib/python2.4/posixpath.py", line 298, in walk > walk(name, func, arg) > File "/usr/lib/python2.4/posixpath.py", line 298, in walk > walk(name, func, arg) > File "/usr/lib/python2.4/posixpath.py", line 290, in walk > func(arg, top, names) > File "sitemap_gen.py", line 974, in PerDirectory > PerFile(dirpath, name) > File "sitemap_gen.py", line 959, in PerFile > consumer(url, False) > File "sitemap_gen.py", line 1839, in ConsumeURL > self._urls[hash] = 1 > MemoryError > --- > I am using the directory node this is what i have:
> ---
> (My Site Path) Is my long path that i actually have typed in but I'm
> And yes those pages are all different one for each type of part we
> Thanks for your help so far!
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
| ||||||||||||||