From cxcxcxcxcx at gmail.com Tue Apr 14 13:22:41 2009 From: cxcxcxcxcx at gmail.com (nicx) Date: Tue, 14 Apr 2009 06:22:41 +0300 Subject: cyrillic characters unavailable in config.ini Message-ID: Hello, Some of the subscribed feeds on my planet have a cyrilic names (with cyrilic characters in original), but after defining them in config.ini, they display as unknown characters in the site. Obviously the original config.ini is not in UTF-8 format. However, saving it as UTF-8 file it gaves an error when rebuilding the planet (python.py ... ...). Any idea what I'm talking about? :) Thanks in advance, Regards -------------- next part -------------- An HTML attachment was scrubbed... URL: /archives/devel/attachments/20090414/51d9182e/attachment.htm From vasil at ludost.net Tue Apr 14 18:16:24 2009 From: vasil at ludost.net (Vasil Kolev) Date: Tue, 14 Apr 2009 11:16:24 +0300 Subject: cyrillic characters unavailable in config.ini In-Reply-To: References: Message-ID: <1239696984.6712.1.camel@shrike.home.ludost.net> ? 06:22 +0300 ?? 14.04.2009 (??), nicx ??????: > Hello, > > Some of the subscribed feeds on my planet have a cyrilic names (with > cyrilic characters in original), but after defining them in > config.ini, they display as unknown characters in the site. Obviously > the original config.ini is not in UTF-8 format. However, saving it as > UTF-8 file it gaves an error when rebuilding the planet > (python.py ... ...). > > Any idea what I'm talking about? :) > Erm, no? I have an aggregator on http://swamp.ludost.net, and there are no issues with UTF8 in the config.ini. What's the error you're seeing ? -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 197 bytes Desc: =?UTF-8?Q?=D0=A2=D0=BE=D0=B2=D0=B0?= =?UTF-8?Q?_=D0=B5?= =?UTF-8?Q?_=D1=86=D0=B8=D1=84=D1=80=D0=BE=D0=B2=D0=BE?= =?UTF-8?Q?_=D0=BF=D0=BE=D0=B4=D0=BF=D0=B8=D1=81=D0=B0=D0=BD=D0=B0?= =?UTF-8?Q?_=D1=87=D0=B0=D1=81=D1=82?= =?UTF-8?Q?_=D0=BE=D1=82?= =?UTF-8?Q?_=D0=BF=D0=B8=D1=81=D0=BC=D0=BE=D1=82=D0=BE?= Url : /archives/devel/attachments/20090414/2c4f8b67/attachment.pgp From marcelo at dzestudio.com.br Tue Apr 14 22:45:05 2009 From: marcelo at dzestudio.com.br (Marcelo Oliveira) Date: Tue, 14 Apr 2009 09:45:05 -0300 Subject: RES: cyrillic characters unavailable in config.ini Message-ID: <633BB1E4E83CBB48BB5C074EFE398B5106645A@SERVER1.dzestudio.local> Hi there, Tell us about the error showed. I had some problems with UTF-8 but already done here. Marcelo Oliveira DZ Est?dio - Comunica??o Online Rua Ramiro Barcelos, 1056 / 604 CEP 90035-002 - Moinhos de Vento - Porto Alegre RS Fone: +55 [51] 3268.4313 - www.dzestudio.com.br -----Mensagem original----- De: devel-bounces at lists.planetplanet.org [mailto:devel-bounces at lists.planetplanet.org] Em nome de Vasil Kolev Enviada em: ter?a-feira, 14 de abril de 2009 05:16 Para: devel at lists.planetplanet.org Assunto: Re: cyrillic characters unavailable in config.ini ? 06:22 +0300 ?? 14.04.2009 (??), nicx ??????: > Hello, > > Some of the subscribed feeds on my planet have a cyrilic names (with > cyrilic characters in original), but after defining them in > config.ini, they display as unknown characters in the site. Obviously > the original config.ini is not in UTF-8 format. However, saving it as > UTF-8 file it gaves an error when rebuilding the planet (python.py ... > ...). > > Any idea what I'm talking about? :) > Erm, no? I have an aggregator on http://swamp.ludost.net, and there are no issues with UTF8 in the config.ini. What's the error you're seeing ? From feeds at greenfeed.org.uk Fri Apr 17 01:54:01 2009 From: feeds at greenfeed.org.uk (Gordon Hodgson) Date: Thu, 16 Apr 2009 16:54:01 +0100 Subject: Venus: My Homepage has suddently stopped updating In-Reply-To: <17F518DE-3488-4AF5-B0E4-28435DC8C56F@greenfeed.org.uk> References: <17F518DE-3488-4AF5-B0E4-28435DC8C56F@greenfeed.org.uk> Message-ID: <3AC69E86-8B1F-4851-ACAF-D6963E19BF10@greenfeed.org.uk> Apparently my host is aout-terminating scripts that take longer than 60 seconds to run. Is there anyway to split the config.ini into smaller chunks? -- Gordon Hodgson gordon at gordonhodgson.co.uk In January 2009, I hiked up Arthur's Seat 31 times in 31 days for the Chest, Heart & Stroke Association Scotland Please sponsor me here: http://www.justgiving.com/gordonhodgson On 16 Apr 2009, at 15:58, Gordon Hodgson wrote: > Other pages have continued to work. > > At first I thought it was something screwed up on the server side, > but when I deleted all but two feeds from the config.ini it started > working again (I will experiment with larger numbers of feeds) > > The cron debug logs show the usual feed-checking, but they never get > round to the html-generating templates. > > I've got no idea why this suddenly happened. > > Gordon From planet at philwilson.org Fri Apr 17 02:08:55 2009 From: planet at philwilson.org (Phil Wilson) Date: Thu, 16 Apr 2009 17:08:55 +0100 Subject: Venus: My Homepage has suddently stopped updating In-Reply-To: <3AC69E86-8B1F-4851-ACAF-D6963E19BF10@greenfeed.org.uk> References: <17F518DE-3488-4AF5-B0E4-28435DC8C56F@greenfeed.org.uk> <3AC69E86-8B1F-4851-ACAF-D6963E19BF10@greenfeed.org.uk> Message-ID: <9159c3dc0904160908r295e81e8jc8db8d1c1988a3bd@mail.gmail.com> I *think* you could break the config into multiple files yourself and then use the offline mode to generate the venus pages just from the cache. So run planet.py multiple times, once for each config file (this should populate your cache directory), followed by python planet.py -o config.ini where config.ini is an arbitrary one of your config files. Cheers, Phil 2009/4/16 Gordon Hodgson : > Apparently my host is aout-terminating scripts that take longer than > 60 seconds to run. Is there anyway to split the config.ini into > smaller chunks? > -- > Gordon Hodgson > gordon at gordonhodgson.co.uk > > In January 2009, I hiked up Arthur's Seat 31 times in 31 days for the > Chest, Heart & Stroke Association Scotland > Please sponsor me here: http://www.justgiving.com/gordonhodgson > > On 16 Apr 2009, at 15:58, Gordon Hodgson wrote: > >> Other pages have continued to work. >> >> At first I thought it was something screwed up on the server side, >> but when I deleted all but two feeds from the config.ini it started >> working again (I will experiment with larger numbers of feeds) >> >> The cron debug logs show the usual feed-checking, but they never get >> round to the html-generating templates. >> >> I've got no idea why this suddenly happened. >> >> Gordon > > -- > devel mailing list > devel at lists.planetplanet.org > http://lists.planetplanet.org/mailman/listinfo/devel > From chakradeo+planet at gmail.com Fri Apr 17 02:42:17 2009 From: chakradeo+planet at gmail.com (=?UTF-8?B?QW1pdCBDaGFrcmFkZW8gKOCkheCkruCkv+CkpCDgpJrgpJXgpY3gpLDgpKbgpYfgpLUp?=) Date: Thu, 16 Apr 2009 09:42:17 -0700 Subject: Venus: My Homepage has suddently stopped updating In-Reply-To: <3AC69E86-8B1F-4851-ACAF-D6963E19BF10@greenfeed.org.uk> References: <17F518DE-3488-4AF5-B0E4-28435DC8C56F@greenfeed.org.uk> <3AC69E86-8B1F-4851-ACAF-D6963E19BF10@greenfeed.org.uk> Message-ID: <51e033810904160942n3daddfdasf39a70b3a683b6a2@mail.gmail.com> On Thu, Apr 16, 2009 at 8:54 AM, Gordon Hodgson wrote: > Apparently my host is aout-terminating scripts that take longer than > 60 seconds to run. Is there anyway to split the config.ini into > smaller chunks? > Not a proper solution, but you could try running spider and splice scripts separately python spider.py config.ini and later python splice.py config.ini > -- > Gordon Hodgson > gordon at gordonhodgson.co.uk > > In January 2009, I hiked up Arthur's Seat 31 times in 31 days for the > Chest, Heart & Stroke Association Scotland > Please sponsor me here: http://www.justgiving.com/gordonhodgson > > On 16 Apr 2009, at 15:58, Gordon Hodgson wrote: > > > Other pages have continued to work. > > > > At first I thought it was something screwed up on the server side, > > but when I deleted all but two feeds from the config.ini it started > > working again (I will experiment with larger numbers of feeds) > > > > The cron debug logs show the usual feed-checking, but they never get > > round to the html-generating templates. > > > > I've got no idea why this suddenly happened. > > > > Gordon > > -- > devel mailing list > devel at lists.planetplanet.org > http://lists.planetplanet.org/mailman/listinfo/devel > -------------- next part -------------- An HTML attachment was scrubbed... URL: /archives/devel/attachments/20090416/486fe1a9/attachment.htm From rubys at intertwingly.net Fri Apr 17 02:52:22 2009 From: rubys at intertwingly.net (Sam Ruby) Date: Thu, 16 Apr 2009 12:52:22 -0400 Subject: Venus: My Homepage has suddently stopped updating In-Reply-To: <51e033810904160942n3daddfdasf39a70b3a683b6a2@mail.gmail.com> References: <17F518DE-3488-4AF5-B0E4-28435DC8C56F@greenfeed.org.uk> <3AC69E86-8B1F-4851-ACAF-D6963E19BF10@greenfeed.org.uk> <51e033810904160942n3daddfdasf39a70b3a683b6a2@mail.gmail.com> Message-ID: <49E76246.8030506@intertwingly.net> Amit Chakradeo (???? ???????) wrote: > > > On Thu, Apr 16, 2009 at 8:54 AM, Gordon Hodgson > wrote: > > Apparently my host is aout-terminating scripts that take longer than > 60 seconds to run. Is there anyway to split the config.ini into > smaller chunks? > > Not a proper solution, but you could try running spider and splice > scripts separately > > python spider.py config.ini > > and later > python splice.py config.ini What's improper about that? :-) If a config.ini is split up into smaller pieces, running spider on each of the smaller lists will populate the cache. A separate, master, list could simply include each of the subconfigs: http://intertwingly.net/code/venus/tests/data/config/rlist-config.ini Running splice on the master list will pull from everything in the cache. - Sam Ruby From feeds at greenfeed.org.uk Fri Apr 17 03:13:26 2009 From: feeds at greenfeed.org.uk (Gordon Hodgson) Date: Thu, 16 Apr 2009 18:13:26 +0100 Subject: Venus: My Homepage has suddently stopped updating In-Reply-To: <49E76246.8030506@intertwingly.net> References: <17F518DE-3488-4AF5-B0E4-28435DC8C56F@greenfeed.org.uk> <3AC69E86-8B1F-4851-ACAF-D6963E19BF10@greenfeed.org.uk> <51e033810904160942n3daddfdasf39a70b3a683b6a2@mail.gmail.com> <49E76246.8030506@intertwingly.net> Message-ID: What I've done is cut my config.ini in half into two files called config1.ini and config2.ini (the scripts were terminating before the spidering was complete) Then I've run spider config1.ini And then 5 minutes later spider config 2.ini And then 5 minutes later splice config.ini Does this make sense? I'm not completely sure about how much of the file info can be removed (for instance, whether the avatar stuff and most of the [planet] section needs to be in the spidered files (although I've cut it out, and everything seems to work fine). Can anything be cut out from the master config.ini (spliced) file? I haven;t been able to work all this stuff out too easily from the Venus documentation website (and I don't understand python). Many thanks, Gordon On 16 Apr 2009, at 17:52, Sam Ruby wrote: > Amit Chakradeo (???? ???????) wrote: >> On Thu, Apr 16, 2009 at 8:54 AM, Gordon Hodgson > > wrote: >> Apparently my host is aout-terminating scripts that take longer >> than >> 60 seconds to run. Is there anyway to split the config.ini into >> smaller chunks? >> Not a proper solution, but you could try running spider and splice >> scripts separately >> python spider.py config.ini >> and later python splice.py config.ini > > What's improper about that? :-) > > If a config.ini is split up into smaller pieces, running spider on > each of the smaller lists will populate the cache. > > A separate, master, list could simply include each of the subconfigs: > > http://intertwingly.net/code/venus/tests/data/config/rlist-config.ini > > Running splice on the master list will pull from everything in the > cache. > > - Sam Ruby > From chakradeo+planet at gmail.com Fri Apr 17 03:23:53 2009 From: chakradeo+planet at gmail.com (=?UTF-8?B?QW1pdCBDaGFrcmFkZW8gKOCkheCkruCkv+CkpCDgpJrgpJXgpY3gpLDgpKbgpYfgpLUp?=) Date: Thu, 16 Apr 2009 10:23:53 -0700 Subject: Venus: My Homepage has suddently stopped updating In-Reply-To: References: <17F518DE-3488-4AF5-B0E4-28435DC8C56F@greenfeed.org.uk> <3AC69E86-8B1F-4851-ACAF-D6963E19BF10@greenfeed.org.uk> <51e033810904160942n3daddfdasf39a70b3a683b6a2@mail.gmail.com> <49E76246.8030506@intertwingly.net> Message-ID: <51e033810904161023g3f522f26i94ae79697286e71@mail.gmail.com> On Thu, Apr 16, 2009 at 10:13 AM, Gordon Hodgson wrote: > What I've done is cut my config.ini in half into two files called > config1.ini and config2.ini (the scripts were terminating before the > spidering was complete) > > Then I've run spider config1.ini > And then 5 minutes later spider config 2.ini > And then 5 minutes later splice config.ini > > Does this make sense? > It does. But it still depends on how many feeds does each config file has and how much time the final splice takes. 60 seconds is quite restrictive. Maybe you can just run the planet on your local machine and just scp/FTP the generated files to the web server ? > > I'm not completely sure about how much of the file info can be removed (for > instance, whether the avatar stuff and most of the [planet] section needs to > be in the spidered files (although I've cut it out, and everything seems to > work fine). Can anything be cut out from the master config.ini (spliced) > file? I haven;t been able to work all this stuff out too easily from the > Venus documentation website (and I don't understand python). > Take a look at the architecture diagram: http://intertwingly.net/code/venus/docs/venus.svg It looks like the splice.py would need the avatar stuff etc. BTW you can leave all the stuff in the config file, parsing config file takes minuscule amount of time compared with rest of the things the scripts have to do... --Amit -------------- next part -------------- An HTML attachment was scrubbed... URL: /archives/devel/attachments/20090416/b6464009/attachment.htm From feeds at greenfeed.org.uk Fri Apr 17 03:37:35 2009 From: feeds at greenfeed.org.uk (Gordon Hodgson) Date: Thu, 16 Apr 2009 18:37:35 +0100 Subject: Venus: My Homepage has suddently stopped updating In-Reply-To: <51e033810904161023g3f522f26i94ae79697286e71@mail.gmail.com> References: <17F518DE-3488-4AF5-B0E4-28435DC8C56F@greenfeed.org.uk> <3AC69E86-8B1F-4851-ACAF-D6963E19BF10@greenfeed.org.uk> <51e033810904160942n3daddfdasf39a70b3a683b6a2@mail.gmail.com> <49E76246.8030506@intertwingly.net> <51e033810904161023g3f522f26i94ae79697286e71@mail.gmail.com> Message-ID: <8FFE2DBD-7772-4283-9145-35D990856E77@greenfeed.org.uk> On 16 Apr 2009, at 18:23, Amit Chakradeo (???? ???????) wrote: > On Thu, Apr 16, 2009 at 10:13 AM, Gordon Hodgson > wrote: > What I've done is cut my config.ini in half into two files called > config1.ini and config2.ini (the scripts were terminating before the > spidering was complete) > > Then I've run spider config1.ini > And then 5 minutes later spider config 2.ini > And then 5 minutes later splice config.ini > > Does this make sense? > > > It does. But it still depends on how many feeds does each config > file has and how much time the final splice takes. 60 seconds is > quite restrictive. Yes, the host has limited scripts to 60 seconds, and says that the longest recorded time he could find for me running the planet.py script is 80 seconds... now I have to run three scripts. Surely that will take more than 60 seconds of CPU time...? > Maybe you can just run the planet on your local machine and just scp/ > FTP the generated files to the web server ? Not ideal. I don't like to leave my machines turned on more than I have to. > I'm not completely sure about how much of the file info can be > removed (for instance, whether the avatar stuff and most of the > [planet] section needs to be in the spidered files (although I've > cut it out, and everything seems to work fine). Can anything be cut > out from the master config.ini (spliced) file? I haven;t been able > to work all this stuff out too easily from the Venus documentation > website (and I don't understand python). > > Take a look at the architecture diagram: http://intertwingly.net/code/venus/docs/venus.svg > It looks like the splice.py would need the avatar stuff etc. BTW you > can leave all the stuff in the config file, parsing config file > takes minuscule amount of time compared with rest of the things the > scripts have to do... Yeah, that sort of makes sense to me... On slight tangent, is there any way the cron emails can report if a script finished early? I can only tell now, because occassionally one of the lines of the email is truncated (but sometimes it isn't). I've also reduced the time-out time to 10 seconds is this reasonable? I'm not even spidering a huge number of feeds... Something like 113... > --Amit Cheers for your help so far, Gordon -------------- next part -------------- An HTML attachment was scrubbed... URL: /archives/devel/attachments/20090416/b10e8146/attachment-0001.htm From feeds at greenfeed.org.uk Fri Apr 17 00:58:05 2009 From: feeds at greenfeed.org.uk (Gordon Hodgson) Date: Thu, 16 Apr 2009 15:58:05 +0100 Subject: Venus: My Homepage has suddently stopped updating Message-ID: <17F518DE-3488-4AF5-B0E4-28435DC8C56F@greenfeed.org.uk> Other pages have continued to work. At first I thought it was something screwed up on the server side, but when I deleted all but two feeds from the config.ini it started working again (I will experiment with larger numbers of feeds) The cron debug logs show the usual feed-checking, but they never get round to the html-generating templates. I've got no idea why this suddenly happened. Gordon From feeds at greenfeed.org.uk Fri Apr 17 04:41:36 2009 From: feeds at greenfeed.org.uk (Gordon Hodgson) Date: Thu, 16 Apr 2009 19:41:36 +0100 Subject: Venus: My Homepage has suddently stopped updating In-Reply-To: <8FFE2DBD-7772-4283-9145-35D990856E77@greenfeed.org.uk> References: <17F518DE-3488-4AF5-B0E4-28435DC8C56F@greenfeed.org.uk> <3AC69E86-8B1F-4851-ACAF-D6963E19BF10@greenfeed.org.uk> <51e033810904160942n3daddfdasf39a70b3a683b6a2@mail.gmail.com> <49E76246.8030506@intertwingly.net> <51e033810904161023g3f522f26i94ae79697286e71@mail.gmail.com> <8FFE2DBD-7772-4283-9145-35D990856E77@greenfeed.org.uk> Message-ID: Ok, so I found taking the avatar thing out of the spidered files seemed to stop the default avatar from working for some (but not all) of my feeds... Don;t know why. Am I correct in thinking the log levels make a hierarchy like this DEBUG INFO WARNING ERROR CRITICAL so DEBUG gives all message levels downwards, INFO gives all except DEBUG, and so on? Gordon On 16 Apr 2009, at 18:37, Gordon Hodgson wrote: > > On 16 Apr 2009, at 18:23, Amit Chakradeo (???? > ???????) wrote: > >> On Thu, Apr 16, 2009 at 10:13 AM, Gordon Hodgson > > wrote: >> What I've done is cut my config.ini in half into two files called >> config1.ini and config2.ini (the scripts were terminating before >> the spidering was complete) >> >> Then I've run spider config1.ini >> And then 5 minutes later spider config 2.ini >> And then 5 minutes later splice config.ini >> >> Does this make sense? >> >> >> It does. But it still depends on how many feeds does each config >> file has and how much time the final splice takes. 60 seconds is >> quite restrictive. > > Yes, the host has limited scripts to 60 seconds, and says that the > longest recorded time he could find for me running the planet.py > script is 80 seconds... now I have to run three scripts. Surely that > will take more than 60 seconds of CPU time...? > > >> Maybe you can just run the planet on your local machine and just >> scp/FTP the generated files to the web server ? > > Not ideal. I don't like to leave my machines turned on more than I > have to. > > >> I'm not completely sure about how much of the file info can be >> removed (for instance, whether the avatar stuff and most of the >> [planet] section needs to be in the spidered files (although I've >> cut it out, and everything seems to work fine). Can anything be cut >> out from the master config.ini (spliced) file? I haven;t been able >> to work all this stuff out too easily from the Venus documentation >> website (and I don't understand python). >> >> Take a look at the architecture diagram: http://intertwingly.net/code/venus/docs/venus.svg >> It looks like the splice.py would need the avatar stuff etc. BTW >> you can leave all the stuff in the config file, parsing config file >> takes minuscule amount of time compared with rest of the things the >> scripts have to do... > > Yeah, that sort of makes sense to me... > > On slight tangent, is there any way the cron emails can report if a > script finished early? > > I can only tell now, because occassionally one of the lines of the > email is truncated (but sometimes it isn't). > > I've also reduced the time-out time to 10 seconds is this > reasonable? I'm not even spidering a huge number of feeds... > Something like 113... > > >> --Amit > > Cheers for your help so far, > > Gordon > -- > devel mailing list > devel at lists.planetplanet.org > http://lists.planetplanet.org/mailman/listinfo/devel -------------- next part -------------- An HTML attachment was scrubbed... URL: /archives/devel/attachments/20090416/c1b6c05a/attachment.htm From rubys at intertwingly.net Fri Apr 17 05:05:31 2009 From: rubys at intertwingly.net (Sam Ruby) Date: Thu, 16 Apr 2009 15:05:31 -0400 Subject: Venus: My Homepage has suddently stopped updating In-Reply-To: References: <17F518DE-3488-4AF5-B0E4-28435DC8C56F@greenfeed.org.uk> <3AC69E86-8B1F-4851-ACAF-D6963E19BF10@greenfeed.org.uk> <51e033810904160942n3daddfdasf39a70b3a683b6a2@mail.gmail.com> <49E76246.8030506@intertwingly.net> <51e033810904161023g3f522f26i94ae79697286e71@mail.gmail.com> <8FFE2DBD-7772-4283-9145-35D990856E77@greenfeed.org.uk> Message-ID: <49E7817B.20904@intertwingly.net> Gordon Hodgson wrote: > Ok, so I found taking the avatar thing out of the spidered files seemed > to stop the default avatar from working for some (but not all) of my > feeds... Don;t know why. > > Am I correct in thinking the log levels make a hierarchy like this > DEBUG > INFO > WARNING > ERROR > CRITICAL > > so DEBUG gives all message levels downwards, INFO gives all except > DEBUG, and so on? Yes: http://docs.python.org/library/logging.html#logging-levels > Gordon > > On 16 Apr 2009, at 18:37, Gordon Hodgson wrote: > >> On 16 Apr 2009, at 18:23, Amit Chakradeo (???? ???????) wrote: >> >>> On Thu, Apr 16, 2009 at 10:13 AM, Gordon Hodgson >>> > wrote: >>> >>> What I've done is cut my config.ini in half into two files called >>> config1.ini and config2.ini (the scripts were terminating before >>> the spidering was complete) >>> >>> Then I've run spider config1.ini >>> And then 5 minutes later spider config 2.ini >>> And then 5 minutes later splice config.ini >>> >>> Does this make sense? >>> >>> It does. But it still depends on how many feeds does each config file >>> has and how much time the final splice takes. 60 seconds is quite >>> restrictive. >> >> Yes, the host has limited scripts to 60 seconds, and says that the >> longest recorded time he could find for me running the planet.py >> script is 80 seconds... now I have to run three scripts. Surely that >> will take more than 60 seconds of CPU time...? >> >>> Maybe you can just run the planet on your local machine and just >>> scp/FTP the generated files to the web server ? >> >> Not ideal. I don't like to leave my machines turned on more than I >> have to. >> >>> I'm not completely sure about how much of the file info can be >>> removed (for instance, whether the avatar stuff and most of the >>> [planet] section needs to be in the spidered files (although I've >>> cut it out, and everything seems to work fine). Can anything be >>> cut out from the master config.ini (spliced) file? I haven;t been >>> able to work all this stuff out too easily from the Venus >>> documentation website (and I don't understand python). >>> >>> Take a look at the architecture >>> diagram: http://intertwingly.net/code/venus/docs/venus.svg >>> It looks like the >>> splice.py would need the avatar stuff etc. BTW you can leave all the >>> stuff in the config file, parsing config file takes minuscule amount >>> of time compared with rest of the things the scripts have to do... >> >> Yeah, that sort of makes sense to me... >> >> On slight tangent, is there any way the cron emails can report if a >> script finished early? Unfortunately, it probably works the other way: cron will email you the output, so you can get a report if it runs to completion; getting a report if the script is terminated early would be a bit harder to accomplish... >> I can only tell now, because occassionally one of the lines of the >> email is truncated (but sometimes it isn't). >> >> I've also reduced the time-out time to 10 seconds is this reasonable? >> I'm not even spidering a huge number of feeds... Something like 113... time-out probably won't help if the limit is CPU seconds. Feeds that aren't updated since the last run won't take much CPU time. Feeds that are updated but you don't get to due to the job getting killed early will be processed on the next run, presumably building up a backlog. >>> --Amit >> >> Cheers for your help so far, >> >> Gordon >> -- >> devel mailing list >> devel at lists.planetplanet.org >> http://lists.planetplanet.org/mailman/listinfo/devel - Sam Ruby From dncarreira at gmail.com Fri Apr 17 18:56:41 2009 From: dncarreira at gmail.com (Duarte Carreira) Date: Fri, 17 Apr 2009 09:56:41 +0100 Subject: divide into pages Message-ID: <6717b0ac0904170156xf33a881qddac6efd36fdf915@mail.gmail.com> Hi. Is it possible to keep older content, dividing into multiple pages? So page #1 would show the most recent posts, then page#2, #3, ..., would keep older posts? Thanks, Duarte -------------- next part -------------- An HTML attachment was scrubbed... URL: /archives/devel/attachments/20090417/05ba22f9/attachment.htm From feeds at greenfeed.org.uk Mon Apr 20 23:40:43 2009 From: feeds at greenfeed.org.uk (Gordon Hodgson) Date: Mon, 20 Apr 2009 14:40:43 +0100 Subject: Venus: My Homepage has suddently stopped updating In-Reply-To: <1240069973.10062.50.camel@kurtz> References: <17F518DE-3488-4AF5-B0E4-28435DC8C56F@greenfeed.org.uk> <3AC69E86-8B1F-4851-ACAF-D6963E19BF10@greenfeed.org.uk> <1240069973.10062.50.camel@kurtz> Message-ID: <6DD9DD33-ABE8-498B-88E0-AA981835AC9B@greenfeed.org.uk> My host doesn't give me SSH... Gordon On 18 Apr 2009, at 16:52, Santiago Gala wrote: > El jue, 16-04-2009 a las 16:54 +0100, Gordon Hodgson escribi?: >> Apparently my host is aout-terminating scripts that take longer than >> 60 seconds to run. Is there anyway to split the config.ini into >> smaller chunks? > > Workaround: It might be that it is only a default ulimit limitation, > and > you can lift it somehow with the ulimit command. I used to have a host > with the same default limit, but something like "ulimit -t 600" might > work and do the job for you. See ulimit in "man bash". > > I used to have a hosting provider with such default limits > > Regards > Santiago > >> -- >> Gordon Hodgson >> gordon at gordonhodgson.co.uk >> >> In January 2009, I hiked up Arthur's Seat 31 times in 31 days for the >> Chest, Heart & Stroke Association Scotland >> Please sponsor me here: http://www.justgiving.com/gordonhodgson >> >> On 16 Apr 2009, at 15:58, Gordon Hodgson wrote: >> >>> Other pages have continued to work. >>> >>> At first I thought it was something screwed up on the server side, >>> but when I deleted all but two feeds from the config.ini it started >>> working again (I will experiment with larger numbers of feeds) >>> >>> The cron debug logs show the usual feed-checking, but they never get >>> round to the html-generating templates. >>> >>> I've got no idea why this suddenly happened. >>> >>> Gordon >> > From feeds at greenfeed.org.uk Mon Apr 20 23:40:58 2009 From: feeds at greenfeed.org.uk (Gordon Hodgson) Date: Mon, 20 Apr 2009 14:40:58 +0100 Subject: divide into pages In-Reply-To: <6717b0ac0904170156xf33a881qddac6efd36fdf915@mail.gmail.com> References: <6717b0ac0904170156xf33a881qddac6efd36fdf915@mail.gmail.com> Message-ID: I'm interested in this too... On 17 Apr 2009, at 09:56, Duarte Carreira wrote: > Hi. > > Is it possible to keep older content, dividing into multiple pages? > So page #1 would show the most recent posts, then page#2, #3, ..., > would keep older posts? > > Thanks, > Duarte > -- > devel mailing list > devel at lists.planetplanet.org > http://lists.planetplanet.org/mailman/listinfo/devel From morten at wasab.dk Fri Apr 24 17:14:10 2009 From: morten at wasab.dk (=?ISO-8859-1?Q?Morten_H=F8ybye_Frederiksen?=) Date: Fri, 24 Apr 2009 09:14:10 +0200 Subject: divide into pages In-Reply-To: References: <6717b0ac0904170156xf33a881qddac6efd36fdf915@mail.gmail.com> Message-ID: <181621f20904240014m1fd1904k15d1fdae510ac101@mail.gmail.com> Hi, On Mon, Apr 20, 2009 at 15:40, Gordon Hodgson wrote: > I'm interested in this too... > On 17 Apr 2009, at 09:56, Duarte Carreira wrote: >> Is it possible to keep older content, dividing into multiple pages? >> So page #1 would show the most recent posts, then page#2, #3, ..., >> would keep older posts? I have implemented and use a solution, that is based on WordPress: http://wordpress.org/extend/plugins/wp-venus/ It hasn't been tested with the latest version(s) of WordPress, though... Regards, Morten From dncarreira at gmail.com Sun Apr 26 00:33:28 2009 From: dncarreira at gmail.com (Duarte Carreira) Date: Sat, 25 Apr 2009 15:33:28 +0100 Subject: divide into pages In-Reply-To: <181621f20904240014m1fd1904k15d1fdae510ac101@mail.gmail.com> References: <6717b0ac0904170156xf33a881qddac6efd36fdf915@mail.gmail.com> <181621f20904240014m1fd1904k15d1fdae510ac101@mail.gmail.com> Message-ID: <6717b0ac0904250733n68a3f440pf5eebf99dbab32f8@mail.gmail.com> Morten, that seems interesting. I'll have a look at it. Duarte On Fri, Apr 24, 2009 at 8:14 AM, Morten H?ybye Frederiksen wrote: > Hi, > > On Mon, Apr 20, 2009 at 15:40, Gordon Hodgson > wrote: > > I'm interested in this too... > > On 17 Apr 2009, at 09:56, Duarte Carreira wrote: > >> Is it possible to keep older content, dividing into multiple pages? > >> So page #1 would show the most recent posts, then page#2, #3, ..., > >> would keep older posts? > I have implemented and use a solution, that is based on WordPress: > http://wordpress.org/extend/plugins/wp-venus/ > > It hasn't been tested with the latest version(s) of WordPress, though... > > > Regards, > Morten > -------------- next part -------------- An HTML attachment was scrubbed... URL: /archives/devel/attachments/20090425/3ac8dcd1/attachment.html From chakradeo+planet at gmail.com Thu Apr 30 10:15:25 2009 From: chakradeo+planet at gmail.com (=?UTF-8?B?QW1pdCBDaGFrcmFkZW8gKOCkheCkruCkv+CkpCDgpJrgpJXgpY3gpLDgpKbgpYfgpLUp?=) Date: Wed, 29 Apr 2009 17:15:25 -0700 Subject: help with venus filters Message-ID: <51e033810904291715y290ba145xf3a246a4f54a1834@mail.gmail.com> Hi, I have a few feeds (blogger based) which seem to be double escaping the content. So I need to "unescape" the html (i.e. replace < with < etc.). So I am thinking I can use xml.sax.saxutils.unescape function. Also I think I can use the venus's filter mechanism to pass the atom entry and get back unescaped atom entry. But I cannot seem to see how to actually do this. Should this be similar to the excerpt.py filter in the venus source code ? Also how to quickly test this function ? Can I just pipe the cached entry to the filter program ? Thanks! Amit -------------- next part -------------- An HTML attachment was scrubbed... URL: /archives/devel/attachments/20090429/e150fcb9/attachment.htm