Commit Graph

  • 1eb813bd43 shifted index deletion-on-exit rule to the class where the errors are produced orbiter 2008-09-12 11:51:48 +00:00
  • ba76995d2c * fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1415 f1ori 2008-09-12 10:54:11 +00:00
  • bea6c13139 * with r5137 robotParser didn't work at all -> fix f1ori 2008-09-12 09:06:38 +00:00
  • 3ded1efe84 kelondroExceptionCounter didn't work lotus 2008-09-11 18:51:47 +00:00
  • ae677e1738 * fix problem in robotparser, see http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1421&p=9742 f1ori 2008-09-11 18:12:17 +00:00
  • 383d89481e count errors before deleting collection.index lotus 2008-09-10 16:40:20 +00:00
  • 0bb4fbc403 delete corrupted collecion.index on exit for rebuild on next start see http://forum.yacy-websuche.de/viewtopic.php?p=9725#p9725 lotus 2008-09-10 12:55:14 +00:00
  • b68d06a6e8 performance settings based on network's remote crawl speed removed some _pro values from config lotus 2008-09-10 12:52:17 +00:00
  • d60b2b198d proxy fixed 'not modified' http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1419 danielr 2008-09-10 11:06:22 +00:00
  • bd0318ba81 * YaCy only supports gzip-encoding, so remove any other encoding from request * fixes http://www.yacy-forum.org/viewtopic.php?f=2&t=163 f1ori 2008-09-09 14:04:52 +00:00
  • bb5c898441 enhancements to localsearch behavior orbiter 2008-09-09 10:24:42 +00:00
  • 42e2d195ac added hint from http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1294 orbiter 2008-09-08 22:37:58 +00:00
  • 39964e88fa fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1329#p9121 orbiter 2008-09-08 22:06:45 +00:00
  • 3f3673b6e5 extended balancer: - added automatic time delay in case that a large number of urls come from the same domain - added additional time delay in case that an url is a dynamic (CGI) url. This shall cause less IO on targets orbiter 2008-09-08 21:50:37 +00:00
  • 3c6e8d2015 set default ppm when network is switched orbiter 2008-09-08 18:20:05 +00:00
  • 20c2d3c248 fix for bad formatting in CrawlResults orbiter 2008-09-08 13:59:35 +00:00
  • 01d3b2bd36 ahem.. 6PPM, not 10. orbiter 2008-09-08 09:51:08 +00:00
  • 3288c19c1a reduce remote crawl PPM for fresh peers in freeworld to 6 PPM orbiter 2008-09-08 09:49:08 +00:00
  • b92105c8b0 do not change auto recrawl scheduler with performance profiles lotus 2008-09-07 13:59:24 +00:00
  • 5ce9a100bb fix(2) for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1416 lotus 2008-09-07 13:57:53 +00:00
  • cf29ca19d4 possible fix for POST character encoding http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1374 danielr 2008-09-07 13:10:46 +00:00
  • a2eeb6138c fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1416 danielr 2008-09-07 13:04:17 +00:00
  • d09ddabd09 corrected a design mistake (5-byte hashes not necessary) orbiter 2008-09-04 21:28:00 +00:00
  • c97d0fcee7 modified the domain list export function: - used the new superfast domain list generation from the domain statistics - better interactive behavior orbiter 2008-09-04 20:28:36 +00:00
  • 77ee0765a4 - added domain statistic generation to IndexControlURLs_p.html servlet - added 'delete all' button to all results of such a domain statistic output which causes that all urls to this domain are deleted - extended stack cleaner to clean also the statistics: they are not completely destroyed, only the smallest counting domains are removed orbiter 2008-09-04 19:41:57 +00:00
  • 44bc8311af translation fix lotus 2008-09-04 19:26:59 +00:00
  • e5c0b969d6 * save performance profile speed * fix for wrong javastart_priority after first start lotus 2008-09-04 19:12:22 +00:00
  • d7a16c1f30 * added shutdown on search page (this page is shown after clicking the tray icon) * shorter, less technical words for configuration-links lotus 2008-09-04 12:51:05 +00:00
  • 80a7bc93d6 - added statistical evaluation about domains that appear during crawling - added tables that show this statistics in CrawlResults web pages orbiter 2008-09-04 09:59:17 +00:00
  • 4a4f388ca5 re-design and simplification of crawl start menu layout orbiter 2008-09-04 07:56:29 +00:00
  • 4fbee21cea - added fetch-ahead again (had been removed in last commit) - reverted default query mode to verify=false orbiter 2008-09-03 23:50:13 +00:00
  • 423a89ebe8 * fix if yacy was installed to a path with whitespace * show nice dots when waiting for restart/update lotus 2008-09-03 18:49:02 +00:00
  • fc03b0437a fixed a error case where a second search after a first search with a different search word failed orbiter 2008-09-03 15:55:25 +00:00
  • eca171ba2e fix for case where javascript was not filtered by the html parser see http://forum.yacy-websuche.de/viewtopic.php?p=9667#p9667 orbiter 2008-09-03 14:41:20 +00:00
  • 992635c074 translation update daburna 2008-09-03 13:44:58 +00:00
  • e645bae29f display table in log lotus 2008-09-03 13:14:01 +00:00
  • ead39064c5 fixed problem with wrong result number calculation orbiter 2008-09-03 10:04:46 +00:00
  • 2437beb96c fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1360&p=9321#p9321 hermens 2008-09-03 07:39:03 +00:00
  • 7b12e77a63 fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1393&hilit=&p=9655#p9655 orbiter 2008-09-03 00:50:42 +00:00
  • 05dbba4bab added logging conditions to all fine and finest log line calls this will prevent an overhead for the generation of the log lines in case that they then are not printed orbiter 2008-09-03 00:30:21 +00:00
  • d3d41e2ee4 - fixed problem with searching with quotes (still not complete, but not as bad as before) - fixed parsing of crawl-delay statements when seconds were given with float numbers - enhanced performance of profiling (not too many loggings; not more than one per second) - removed some debug output - fixed wrong return type in logging - added a logging condition in httpd to prevent that logging statements are generated when they are not written (should be added everywhere!) - fixed wrong word distance computation in RWI management orbiter 2008-09-02 23:49:48 +00:00
  • 3a0e96b552 * only create one debian package for all architectures f1ori 2008-09-02 21:53:45 +00:00
  • 3fbfd5a78b * fix for non-changing offset on new search term * dht-heap doesn't has to be deleted (5097), we simply write a new one on exit * do not install YaCy in startup because a Windows-shutdown might corrupt something. Installing YaCy as a service would solve this. lotus 2008-09-02 15:09:31 +00:00
  • 219b93df6a - fixed internal error after receiving chunked POST - removed debug output - added info for "501 Unknown" messages danielr 2008-08-29 13:51:22 +00:00
  • c245c7a45e delete index.dhtin/out.heap if restore fails see http://forum.yacy-websuche.de/viewtopic.php?p=9613#p9613 lotus 2008-08-29 13:10:41 +00:00
  • cd19d0aee6 - added warnings for failed transferRWI (dht-in) - fixed parseMultipart (uncompress gzipped body) (dht-in) - fixed parseMultipart (using content-length only if uncompressed) - better gzipped POST (chunked instead of content-length) (dht-out) danielr 2008-08-29 09:42:39 +00:00
  • 89cf795a5c proper default priority on first start (Windows) lotus 2008-08-29 07:01:38 +00:00
  • 016f57d714 fixed a dead link orbiter 2008-08-28 21:45:58 +00:00
  • df4ff423c4 added additional properties to query id's to distinguish search events better orbiter 2008-08-28 21:15:59 +00:00
  • d6d9b0f14a fixed transferRWI.html 'Read timed out' danielr 2008-08-28 08:37:51 +00:00
  • e503158527 Proxy: fix for never ending loading after POST danielr 2008-08-27 20:46:34 +00:00
  • 73519cbdca fixed pid-file for linux start-script danielr 2008-08-27 19:18:38 +00:00
  • 1a1d57e449 Proxy: added binary passthrough for POST danielr 2008-08-27 08:07:18 +00:00
  • aa6ae77e5e - autoReCrawl: fix for filter settings apfelmaennchen 2008-08-26 21:51:05 +00:00
  • 8ae29bad57 - fix to previous change of Crawl Profile Names apfelmaennchen 2008-08-26 20:42:29 +00:00
  • b8ee04daf1 fix for http://www.yacy-forum.org/viewtopic.php?f=2&t=160 (wrong url in form) f1ori 2008-08-26 18:45:19 +00:00
  • 434104e4a0 - change Crawl profile name for autoreCrawl apfelmaennchen 2008-08-26 18:08:48 +00:00
  • 9ff4fc11da partial fix (images,audio,video) for proxy and content-type problem http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1374 danielr 2008-08-26 16:34:24 +00:00
  • 0df2e47012 changed auto recrawl to comply with new date format lotus 2008-08-26 13:36:10 +00:00
  • d9d9c522a1 addendum to last commit moved recrawl times for standard profiles to constants calculate new specific dates in cleanup job lotus 2008-08-26 13:20:18 +00:00
  • 480497f7c9 changed recrawl use a specific date to define old documents this solves an unwanted recrawl-loop during a running crawl lotus 2008-08-25 20:31:32 +00:00
  • da1b0b2fc6 added two new classes that will be used for the new htcache orbiter 2008-08-25 18:22:23 +00:00
  • 536e77e8b7 modifications towards a single database operation to read/write http header and cached file at once: - removed distinction between header file types for http and ftp; ftp is simulated by using http properties - removed all old resourceInfo classes that handled this distinction - introduced a new distinction between http request and http response objects - unified new response objects with two other object types that had been introduced elsewhere - changed all servlet call methods to use the new http request header object type - divided static object keys for http header properties into request and response types - refactoring here and there (a large number of type changes and many methods merged/moved) orbiter 2008-08-25 18:11:47 +00:00
  • 04310a7255 * added long options and help-option to linux startscript * redirect all error messages to /dev/null f1ori 2008-08-25 17:17:01 +00:00
  • 08cdf6db8a fix for wrong "VegaYacyB" peers borg-0300 2008-08-24 11:30:00 +00:00
  • 296fa2265b YaCy-UI: removed unused Servlet ymarks.java apfelmaennchen 2008-08-24 08:47:37 +00:00
  • a9cf3b42c4 YaCy-UI: removed unused JavaScript apfelmaennchen 2008-08-24 08:43:07 +00:00
  • 08bf3fd235 YaCy-UI: updated to jQuery-ui 1.6b apfelmaennchen 2008-08-24 08:41:52 +00:00
  • 4d937f6b21 fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1396 danielr 2008-08-22 23:46:32 +00:00
  • 5192081283 workaround for BindException at restart on Vista therefore IPv6 support is lost (Windows only) see http://bugs.sun.com/bugdatabase/view_bug.do?bug_id=6286189 and http://forum.yacy-websuche.de/viewtopic.php?t=1385 lotus 2008-08-22 18:46:14 +00:00
  • 58d7e6f1a6 - some small, rather optical changes to bookmarks apfelmaennchen 2008-08-22 18:19:21 +00:00
  • bd931a82f7 - added dynamic filters to autoReCrawl.conf - Restrict to sub-path: sub - Restrict to start-domain: dom apfelmaennchen 2008-08-22 18:05:05 +00:00
  • 8551e1d106 - added Info link (/xml/util/getpageinfo_p.xml) to bookmarks apfelmaennchen 2008-08-21 21:28:57 +00:00
  • 8d1bedfc3a - added bookmarkTitle to CrawlStart_p.html apfelmaennchen 2008-08-21 21:07:21 +00:00
  • b3fc5e96a3 - removed unused import from bookmarksDB apfelmaennchen 2008-08-20 21:26:06 +00:00
  • bc048db7b6 - bugfix for bookmarksDB's rebuildDates() - dates are now saved as String.valueOf(TimeStamp) - it might be a good idea to delete (backup) bookmarkDates.db and restart YaCy to rebuild it apfelmaennchen 2008-08-20 21:25:05 +00:00
  • 3c68905540 remove redundant null checks danielr 2008-08-20 08:37:39 +00:00
  • f9a715dc33 simplified Linux start-script danielr 2008-08-20 07:57:47 +00:00
  • 753a1ae430 - changed default browser from netscape to firefox - fixed "Inefficient use of keySet iterator instead of entrySet iterator" [WMI_WRONG_MAP_ITERATOR, FindBugs] - fixed some possible null pointer accesses danielr 2008-08-20 07:54:56 +00:00
  • 7989335ed6 Preparations to replace the HTCache with a new storage data structure: - refactoring of the HTCache (separation of cache entry) - added new storage class for BLOBs. (not used yet, this is half-way to a new structure) orbiter 2008-08-19 14:10:40 +00:00
  • c05edba6b8 * added text/html-URL to opensearchdescription, so one-click-installation in firefox is possible f1ori 2008-08-17 21:13:41 +00:00
  • be28af50f5 - fixed "yacy2yacy no proxy"-problem danielr 2008-08-17 10:16:32 +00:00
  • 6450a51473 * fixed quoting problem in initscript f1ori 2008-08-14 21:38:05 +00:00
  • f99c307eff * correct debian build dependencies * add huge mem page detection in general initscript * disable logging completely in jmimemagic-library f1ori 2008-08-14 21:01:21 +00:00
  • bdae051d9a - extended new performance graph (better timing) - added paths for new libraries in classpath for eclipse - refactoring to remove compiler warnings (static access to finals variables) - removed some unused import orbiter 2008-08-13 10:37:53 +00:00
  • d9cea5ff23 removed annotations which broke the build with java 1.5 danielr 2008-08-13 09:07:23 +00:00
  • 7d8f332bbd * build yacy for debianpackage only once * do not try to sign .changes file * add some java arguments from ./startYACY.sh to addon/yacyInit.m4 f1ori 2008-08-11 20:26:18 +00:00
  • 2547700de8 relaxed dependencies for use with debian 4.0 etch danielr 2008-08-11 15:12:49 +00:00
  • c2d49cc01e * add build target "deb" to create debian packages from svn still needs testing... f1ori 2008-08-10 15:26:13 +00:00
  • 6a550c64f1 * clean up svn ignorelist, added yacy.pid and logfiles f1ori 2008-08-10 14:56:02 +00:00
  • 59c4e35aaf corrected LargePages-detection danielr 2008-08-10 13:03:36 +00:00
  • f825e88a09 *) large memory pages will only be used if start script can confirm that Linux supports them or that the OS is Solaris, this should eliminate all sorts of weird behavior including not working shutdown low012 2008-08-10 12:02:11 +00:00
  • a087090bbb fixed starting crawl results in "No parser available to parse mimetype 'application/octet-stream'" danielr 2008-08-10 11:31:40 +00:00
  • 7e7e6a099a undo 5044 danielr 2008-08-10 10:54:13 +00:00
  • f2d0bd7790 fix for NPE in JakartaHttpClient.setProxy danielr 2008-08-10 09:37:32 +00:00
  • bb6a6fc233 fixed 'FileUploadException Stream ended unexpectedly' danielr 2008-08-09 22:44:17 +00:00
  • 8422ee5ec4 - fixed UnsupportedEncoding (in proxy) using defaultCharset if no characterEncoding can be determined - serverFileUtils.copy* use now Charset instead of String - added some warnings for ignored exceptions danielr 2008-08-09 12:00:31 +00:00
  • 3ac1988059 Add some sanity checks for invalid seeds hermens 2008-08-08 13:56:29 +00:00
  • cff4393f0c Fix HTCache so oldest Files get deleted first hermens 2008-08-08 08:06:06 +00:00
  • 31d97f2b9f replaced httpd.parseMultipart() by a 'right' implementation danielr 2008-08-08 01:40:28 +00:00