scrapy

mirror of https://github.com/scrapy/scrapy.git synced 2025-03-03 18:17:51 +00:00

Author	SHA1	Message	Date
Pablo Hoffman	c6435c5aa7	restored scrapyd log message	2012-08-31 18:49:20 -03:00
Daniel Graña	dcef7b03c1	format log lines lazily in case they are dropped by loglevels	2012-08-31 17:14:35 -03:00
Pablo Hoffman	94b40162a9	fixed tests to work on windows	2012-08-30 11:24:29 -03:00
Pablo Hoffman	b33303779a	scrapyd.launcher: make SCRAPY_LOG_FILE and SCRAPY_FEED_URI optional	2012-05-21 14:29:15 -03:00
Pablo Hoffman	e521da2e2f	Dropped support for Python 2.5. See: http://blog.scrapy.org/scrapy-dropping-support-for-python-25	2012-03-01 08:18:12 -02:00
Pablo Hoffman	dbda33efa6	scrapyd: added support for storing items by default Items are stored the same way as logs, in jsonlines format. Also renamed logs_to_keep setting to jobs_to_keep.	2012-01-03 23:08:54 -02:00
Pablo Hoffman	485bc180df	scrapyd: improved web interface to also show pending and finished jobs	2012-01-03 23:02:25 -02:00
Pablo Hoffman	9064188035	removed unused import	2011-12-28 15:21:10 -02:00
Pablo Hoffman	150f82e600	some some changes to scrapyd listjobs.json api: * the api is now a GET instead of POST (for consistency) * the api also returns pending and finished jobs, in addition to running ones * only the last 100 finished jobs are kept (can be changed through the finished_to_keep setting)	2011-12-28 15:17:52 -02:00
Pablo Hoffman	fa644f7a5e	Some simplifications to Scrapyd architecture and internals: - launcher no longer knows about egg storage - removed get_spider_list_from_eggifile() file and replaced by simpler get_spider_list() which doesn't receive en egg file as argument - changed "egg runner" name to just "runner" to reflect the fact that it doesn't necesarilly run eggs (though it does in the default case) --HG-- rename : scrapyd/eggrunner.py => scrapyd/runner.py	2010-12-27 16:22:32 -02:00
Pablo Hoffman	85890a5092	scrapyd: log process logfile when process starts/finishes	2010-11-30 15:45:42 -02:00
Pablo Hoffman	5c4f562ec4	scrapyd: changed keys used in poller message to _project, _spider, _job, and added link to log file in web ui	2010-11-30 13:03:20 -02:00
Pablo Hoffman	df54ed0041	Some Scrapyd enhancements: * added minimal web ui * return unique id per job (spider scheduled) * store one log per spider run (job) and rotate them, keeping the last N logs (where N is configurable through settings)	2010-11-30 02:26:31 -02:00
Pablo Hoffman	bbffa59497	Some changes to Scrapyd: * Always start one process per spider * Added max_proc_per_cpu option (defaults to 4) * Return the number of spiders (instead of a list of them) in schedule.json	2010-11-29 17:19:05 -02:00
Pablo Hoffman	4cecbcdc5b	Fixed bug in Scrapyd launcher when running projects without eggs. Refs #238	2010-09-15 21:03:43 -03:00
Pablo Hoffman	833baa6041	Support running projects without eggs in Scrapyd. Closes #238	2010-09-14 01:44:25 -03:00
Pablo Hoffman	9158e9d682	Some changes to Scrapyd to support multiple configuration files, to make it easier to deploy Scrapyd applications. Also documented 'egg_runner' and 'application' options --HG-- rename : debian/scrapyd.cfg => debian/000-default rename : scrapyd/default_scrapyd.cfg => scrapyd/default_scrapyd.conf	2010-09-07 09:17:25 -03:00
Pablo Hoffman	37e9c5d78e	Added new Scrapy service with support for: * multiple projects * uploading scrapy projects as Python eggs * scheduling spiders using a JSON API Documentation is added along with the code. Closes #218. --HG-- rename : debian/scrapy-service.default => debian/scrapyd.default rename : debian/scrapy-service.dirs => debian/scrapyd.dirs rename : debian/scrapy-service.install => debian/scrapyd.install rename : debian/scrapy-service.lintian-overrides => debian/scrapyd.lintian-overrides rename : debian/scrapy-service.postinst => debian/scrapyd.postinst rename : debian/scrapy-service.postrm => debian/scrapyd.postrm rename : debian/scrapy-service.upstart => debian/scrapyd.upstart rename : extras/scrapy.tac => extras/scrapyd.tac	2010-09-03 15:54:42 -03:00

18 Commits