Commit Graph

  • a2fa75e688 *) Asynchronous queuing of crawl job URLs (stackCrawl) various checks like the blacklist check or the robots.txt disallow check are now done by a separate thread to unburden the indexer thread(s) TODO: maybe we have to introduce a threadpool here if it turn out that this single thread is a bottleneck because of the time consuming robots.txt downloads theli 2005-10-05 10:45:33 +00:00
  • 01db66dc69 implemented image-servlets. the imagetest will stay there only for a limited time. Now images can be generated on-the-fly from servlets orbiter 2005-10-05 08:40:20 +00:00
  • f6a0e0f162 small bugfix to readFully orbiter 2005-10-05 07:21:26 +00:00
  • 6d5d0ac801 bugfix for startup problems orbiter 2005-10-05 00:52:55 +00:00
  • a9c466ef21 cleaned, finals, StringBuffer, Properties borg-0300 2005-10-04 17:51:32 +00:00
  • 37ee53856c cut for release 0.41 orbiter 2005-10-04 11:20:20 +00:00
  • 5dc0d41900 bugfix in kelondroRA (hint by Martin) orbiter 2005-10-04 10:24:09 +00:00
  • 0c3a20d44f more + changed log for better understanding of outOfMemory bug and others orbiter 2005-10-04 00:28:59 +00:00
  • e5f8163203 fixed a bug with news; news moving could lead to shurtcut loop / 100% CPU; appeared when clicked on a 'Profile' news in Network menu orbiter 2005-10-03 23:33:10 +00:00
  • d8544e8416 cosmetic borg-0300 2005-10-03 23:13:41 +00:00
  • fbb5e36b80 documentation update orbiter 2005-10-03 23:04:11 +00:00
  • cd77078aa0 old Version restored before Release allo 2005-10-03 18:10:05 +00:00
  • a4b747fe97 ProxyAccounts based on userDB allo 2005-10-03 14:26:08 +00:00
  • 0cb9566546 HTML for last Commit. allo 2005-10-03 12:23:20 +00:00
  • 8d871a6a79 delete Function allo 2005-10-03 12:17:12 +00:00
  • 7df4c85506 bugfix & cleanup allo 2005-10-03 11:51:12 +00:00
  • 0fd9aa6c6e *) Bugfix: supportedFileExt Function didn't detect the file extension correctly because of missing conversion to lower case theli 2005-10-03 10:48:41 +00:00
  • 8a33c9b309 *) Bugfix: supportedFileExt Function didn't detect the file extension correctly if there was a dot in one of the parent directories of the file. theli 2005-10-03 10:21:13 +00:00
  • 28c5687ff9 *) Bugfix for "download of non supported file content" via crawler See: http://www.yacy-forum.de/viewtopic.php?p=10724#10724 theli 2005-10-03 08:45:39 +00:00
  • 2b3f964037 *) Bugfix: supportedFileExt Function didn't chop http parameters before trying to detect the file extension theli 2005-10-03 08:42:55 +00:00
  • f58cfbf9a8 passive red to yellow changed borg-0300 2005-10-01 14:24:59 +00:00
  • 252492d031 update User allo 2005-10-01 13:34:13 +00:00
  • 1b61b674db changed method borg-0300 2005-10-01 11:43:48 +00:00
  • a9e3fefaec Properties borg-0300 2005-09-30 16:30:16 +00:00
  • 2d481e89df Michael would like it borg-0300 2005-09-30 16:06:42 +00:00
  • 60fc32ffeb load values allo 2005-09-30 16:06:20 +00:00
  • d388292f24 *) adding function for user accounting which is called after each http request theli 2005-09-30 16:02:58 +00:00
  • c7b7c69484 *) adding some functions to userDB.Entry theli 2005-09-30 15:41:01 +00:00
  • 8de3b2dd3c User create function allo 2005-09-30 15:25:45 +00:00
  • d9c08a47f9 *) Minor changes to userDB theli 2005-09-30 15:02:17 +00:00
  • af7829df25 *) adding some functions to userDB.Entry theli 2005-09-30 14:56:50 +00:00
  • ff1d3d0680 Init of userDB Pagelayout of User_p.html allo 2005-09-30 13:48:26 +00:00
  • 8d728d742b remove my SVN 818 change borg-0300 2005-09-30 13:46:25 +00:00
  • 71c1dc06ca Properties borg-0300 2005-09-30 13:24:27 +00:00
  • 19648702cc *) adding iterator to userDB theli 2005-09-30 13:21:31 +00:00
  • a3bb20ca88 New column Info replaced Type, Contact and accept Crawl/Index; 11 new pictures show the information in 11x32 pixel; borg-0300 2005-09-30 13:03:36 +00:00
  • dd59c1ace9 *) Adding first version of yacy user DB theli 2005-09-30 10:28:49 +00:00
  • 1d1b8f699e *) Extending proxy autoconfig to avoid problems with multiple local network cards theli 2005-09-30 07:47:28 +00:00
  • 595e0c7e56 *) Bugfix for ProxyErrormsg: Wrong base URL See: http://www.yacy-forum.de/viewtopic.php?p=9905#9905 theli 2005-09-30 06:15:22 +00:00
  • 394fb5d484 *) Small changes that make entering values much easier for people who use the TAB-key to navigate through the page or who use textbowsers like lynx. low012 2005-09-29 20:17:10 +00:00
  • 9e1485c13b new Class for UserAccounts allo 2005-09-29 20:15:31 +00:00
  • 138d218d5b all big number have now separators (not Last Seen); used dot instead of comma; used yacySeed.constants; borg-0300 2005-09-29 13:10:59 +00:00
  • 9c4306e41e fixed problem with htcache path orbiter 2005-09-29 00:24:09 +00:00
  • 25a59a51ad fixed problem created with last svn commit orbiter 2005-09-28 23:16:00 +00:00
  • 0054d3b1a6 added age in network menu orbiter 2005-09-28 22:54:31 +00:00
  • 3dd7e90cdd kbytes instead of bytes in performance settings; new default values orbiter 2005-09-28 18:53:41 +00:00
  • 1669eaaa1a fixed svn 805 orbiter 2005-09-28 14:47:57 +00:00
  • ca82d690a9 changed in SVN 805 one line too much borg-0300 2005-09-28 13:58:42 +00:00
  • 4bb1f849a0 Bugfix for http://www.yacy-forum.de/viewtopic.php?t=1233 borg-0300 2005-09-28 13:49:57 +00:00
  • 2c7b490e30 memory-logging orbiter 2005-09-28 00:52:54 +00:00
  • ee347364d9 basic xhtml/css layout -to test, replace #[he4ader]# with #[xhtml-header]#, #[footer]# with #[xhtml-footer]# and use the xhtml-style css allo 2005-09-27 20:13:34 +00:00
  • 717f7678a4 small changes to german translation rramthun 2005-09-27 19:02:08 +00:00
  • 7fc822a59b changed handling of time-zones orbiter 2005-09-27 16:28:55 +00:00
  • 4c2c5ace01 Adding random seedlist to superseed. will upload php source later, when it is finished :-D rramthun 2005-09-27 12:02:49 +00:00
  • 97990299fa *) Changing lib names after migration to newer versions of PDFBox + jsch Thanks to Hydrox for the advice theli 2005-09-27 07:52:20 +00:00
  • 9b7f37fc37 *) Minor changes - more debugging output: storageTime for indexed document is logged now - saving memory in plasmaParserDocument.java, plasmaWordIndexEntryContainer.java (not a big deal) theli 2005-09-27 07:10:24 +00:00
  • 3c1d968d29 fix-fix for 792 and small changes in ftpc/download/dir experiments orbiter 2005-09-26 10:36:42 +00:00
  • b5a8992d29 *) Setting some object fields to final theli 2005-09-26 09:39:54 +00:00
  • 5f95a1cf62 *) Bugfix for ProxyErrormsg: Wrong http host header See: http://www.yacy-forum.de/viewtopic.php?p=9905#9905 theli 2005-09-26 08:10:40 +00:00
  • 023be89586 *) Bugfix for "Robots.txt wird immer wieder geladen" See: http://www.yacy-forum.de/viewtopic.php?p=10241#10233 theli 2005-09-26 08:05:59 +00:00
  • 35c6c5ead7 *) Bugfix for "Blacklist und Crawlen" Bug. : Crawling continues even if URL is listed in Blacklist See: http://www.yacy-forum.de/viewtopic.php?p=10279#10279 - missing return statement added. Thanks to allo for the code review. theli 2005-09-26 06:51:11 +00:00
  • dc474aa22f various bug-fixes orbiter 2005-09-26 01:10:41 +00:00
  • 9e2fc7e5fe load balancing of crawl target domains orbiter 2005-09-25 01:09:21 +00:00
  • 1aa79f5bb5 cleaned; Properties; (no author) 2005-09-24 11:58:17 +00:00
  • 694226d4b1 typo orbiter 2005-09-24 00:40:52 +00:00
  • 3fcc95a82c integrated crawl-profiles db in memory-performance monitor orbiter 2005-09-24 00:33:27 +00:00
  • 72ce36baba cleanup in kelondroRecords orbiter 2005-09-24 00:14:01 +00:00
  • 0625a4f81b changed test on wget orbiter 2005-09-23 23:35:46 +00:00
  • fe6a6abc0b *) Adding robots.txt db to Performance Settings for Memory menue theli 2005-09-23 01:31:29 +00:00
  • 3274ae725e increased cache size of robots database; however, this should be integrated into new memory control orbiter 2005-09-23 00:37:31 +00:00
  • c6d2f50375 changed order of robots and double-check orbiter 2005-09-23 00:18:08 +00:00
  • 68d5ff2ef1 added stringbuffer in condenser orbiter 2005-09-22 23:43:45 +00:00
  • c42a543bc3 *) Adding peername to logmessage when receiving URLs theli 2005-09-22 23:40:03 +00:00
  • 90f02ea455 *) removing metainfo from serverargs theli 2005-09-22 23:05:34 +00:00
  • 70a5681a4f *) Bugfix for inactive scp seed uploader theli 2005-09-22 23:04:26 +00:00
  • e380d4e55e cleanup (no functional change) orbiter 2005-09-22 21:51:21 +00:00
  • 4aa04972ac bugfix allo 2005-09-22 21:02:08 +00:00
  • de0a58d79c no more sync orbiter 2005-09-22 20:31:55 +00:00
  • 7991c05b49 homePath instead if RootPath allo 2005-09-22 20:25:56 +00:00
  • 495bc8bec6 removed cache-control from low and medium priority caches which reduces memory use and computation overhead orbiter 2005-09-22 20:01:26 +00:00
  • 5bf7d74114 permanent yacy.logging see http://www.yacy-forum.de/viewtopic.php?p=10020 allo 2005-09-22 18:54:36 +00:00
  • 1dc94e7753 *) Adding support for gzip content-encoding of http post requests used to transferRWIs and transferURLs. See: http://www.yacy-forum.de/viewtopic.php?t=1167#10020 theli 2005-09-22 10:30:55 +00:00
  • e6b9b23290 configuration of startup-memory in webinterface orbiter 2005-09-22 01:17:49 +00:00
  • a1f5027a88 finals; cleaned; Properties; borg-0300 2005-09-21 23:18:12 +00:00
  • 979a3ee3c0 exceptions for better testing of bug http://www.yacy-forum.de/viewtopic.php?p=9852#9852 orbiter 2005-09-21 22:58:15 +00:00
  • 18d9e1a256 fix for http://www.yacy-forum.de/viewtopic.php?p=10026#10026 orbiter 2005-09-21 21:56:39 +00:00
  • 1dd7047af5 finals; cleaned; Properties; borg-0300 2005-09-21 21:32:43 +00:00
  • ba145efb46 check for wget allo 2005-09-21 20:42:57 +00:00
  • 211a72ec6b using wget instead of java for shutdown -better performance -more native allo 2005-09-21 20:08:40 +00:00
  • ef85fce661 change of memory-consumption constants (had been much too low) orbiter 2005-09-21 15:32:49 +00:00
  • 07f30931ec various configuration options in memory performance orbiter 2005-09-21 14:21:45 +00:00
  • 2f732e32a2 enhancements to memory menue orbiter 2005-09-21 12:21:01 +00:00
  • 150bd33591 finals; cleaned; Properties; borg-0300 2005-09-21 10:48:46 +00:00
  • e17df64b54 removed IS_ADMIN - feature. This was covered by plasmaSwitchborad.adminAuthenticated orbiter 2005-09-21 09:22:01 +00:00
  • b5337a122c some more information about available memory in PerformaceMemory menu orbiter 2005-09-21 00:12:37 +00:00
  • 96a5b6e8fb removed yacy peer types from serverSwitch orbiter 2005-09-20 23:15:33 +00:00
  • b990dc1ad1 *) Replacing jsch 0.1.19 lib with newer version 0.1.21 *) Replacing PDFBox 0.7.1 lib with newer version 0.7.2 *) Refactoring of classes httpd/httpc/httpHeaders to make many methods for httpHeader/Requestline parsing reusable for new icap implementation *) adding chunked input stream support - needed by new icap implementation - needed by future httpc HTTP/1.1 support *) httpd.java - moving all connection property contants to class httpHeader - moving readHeader function to class httpHeader - moving parseQuery function to class httpHeader - moving handleTransparentProxy function to class httpHeader *) httpHeader.java - adding new fuction to parse the http response line - adding new function to converte http headers to a string that can be send to the client - adding a function that generates a proper url using all parsed connection properties *) ICAP Support - yacy now supports handling of icap response modification requests - this feature can be used by other icap enabled proxies to contact yacy as icap server, and to handover the downloaded content to yacy.logging for indexing - functionality was successfully tested with squid 2.5Stable 10 + icap patch - further icap services e.g. URL filtering based on yacy's blacklists are possible *) plasmaSwitchboard.java - htcache entries that are still needed for indexing are now properly registered as in use after system restart - extended logging: log message now shows parsing and indexing time for each sb. entry theli 2005-09-20 21:49:47 +00:00
  • 6d1de8abfd finals; cleaned; Properties; borg-0300 2005-09-20 15:43:31 +00:00
  • e748ba3f6e super(), finals; other; cleaned; Properties; borg-0300 2005-09-20 15:36:22 +00:00
  • af9021e956 fixed bug with news caching orbiter 2005-09-20 11:30:03 +00:00