Commit Graph

  • 2522c26921 *) peer-search now checks matches to peername and peerhash hydrox 2007-03-24 16:51:19 +00:00
  • 5c3afb3202 added option to configure a path to a secondary index location. this shall be used to store a fragment of the index on another physical device, to split IO load and enhance access speed. The index is splitted in such a way that the LURLs are stored to the secondary location, and the RWIs to the primary location. This is especially useful for environments where symbolic links are not possible and may cause IO access even if there is no write access to the device which hosts the symbolic link. orbiter 2007-03-24 15:28:17 +00:00
  • 07cd30cf9b *) minor changes for last commit theli 2007-03-23 06:35:47 +00:00
  • 51b2df566b *) adding possibility to display a fileshare-dir as RSS feed (e.g. to monitor the directory for changes) usage example: http://localhost:8080/share/?format=rss theli 2007-03-23 06:25:21 +00:00
  • c2e6afbd69 *) bugfix: setting mimeType properly for dir listing with e.g. "?format=xml" theli 2007-03-23 05:37:19 +00:00
  • 242c19b480 completed TLD categorization orbiter 2007-03-22 13:52:00 +00:00
  • 146f4aee01 *) adding mimetype for opml theli 2007-03-22 12:37:14 +00:00
  • b99f9d870d *) fixed double selection of peers for the same DHT-chunk. hydrox 2007-03-22 09:08:38 +00:00
  • e6681b2a79 *) changing RSS feed titles theli 2007-03-22 08:52:58 +00:00
  • f20b596dc0 *) adding servlet to display all deployed SOAP Services - soap related servlets are located in htroot/soap *) new serverContext class for soap theli 2007-03-22 08:30:57 +00:00
  • 7edd5a0b77 *) correcting notifier.gif path theli 2007-03-22 07:42:28 +00:00
  • 8463e29b14 removes addons from minimal install allo 2007-03-21 19:50:31 +00:00
  • df15f71a5c *) avoid NullpointerExceptions if Seed is null theli 2007-03-21 17:21:28 +00:00
  • 75d90834a2 *) adding additional file extension for powerpoint theli 2007-03-21 16:18:58 +00:00
  • 2cb16824e3 removed support for old database structures. The new collection index will be more generalized to support other indexes i.e. YBR block-rank computation. A clean-up of the many conditions to support the old database was necessary. orbiter 2007-03-21 15:35:35 +00:00
  • 716b3d1533 *) updating feed item link theli 2007-03-21 15:11:45 +00:00
  • 81b4598487 *) peer profile can now be displayed as vcard e.g. http://localhost:8080/ViewProfile.vcf?hash=localhash theli 2007-03-21 15:08:18 +00:00
  • 70bd67e73e documentation update orbiter 2007-03-21 14:29:42 +00:00
  • a53403e0b2 *) Updating News Feed for new release theli 2007-03-21 14:13:13 +00:00
  • 3688ec33e5 release 0.51 orbiter 2007-03-21 14:00:17 +00:00
  • 1f61c13697 *) RSS-parser extracts the author tags now theli 2007-03-21 13:35:32 +00:00
  • 602ac42010 fix for OOM case when a kelondroTree Node cache grows See also: http://www.yacy-forum.de/viewtopic.php?p=33275#33275 orbiter 2007-03-21 13:26:18 +00:00
  • b374812f01 *) adding rpm packager as author theli 2007-03-21 13:09:12 +00:00
  • beb772d6cd fixed problem with broken notifier image, occurred only at initial start-up orbiter 2007-03-21 12:23:27 +00:00
  • 40ce33e664 *) adding RSS feed for yacy news theli 2007-03-21 12:22:18 +00:00
  • 589cbd8cbf *) replacing all yacy-news-category strings with corresponding constants Note: please use these constants from now on theli 2007-03-21 11:09:15 +00:00
  • f4af360f7c bugfix allo 2007-03-20 15:37:19 +00:00
  • bb51efbb49 "Bugfix" for Tagdisplay allo 2007-03-19 13:00:33 +00:00
  • 43114af6d7 *) Translated robots.txt-config-page *) Simplified some sentences rramthun 2007-03-19 12:42:07 +00:00
  • 7af188ff9a fix for http://www.yacy-forum.de/viewtopic.php?p=33089#33089 orbiter 2007-03-19 11:59:29 +00:00
  • 5bbf010107 removed synchronization of size() method from numerous classes to avoid thread locking orbiter 2007-03-18 19:45:23 +00:00
  • 6b9eea3932 - removed differentiation between longTitle and shortTitle; this cannot be used for search results, and it is difficult to get both types from all document types - added some author parsing orbiter 2007-03-18 12:33:19 +00:00
  • a738b57b31 added author tag to indexing content enhanced composition of title tag TODO: insert author information for external parsers orbiter 2007-03-17 01:18:34 +00:00
  • 6be57983a8 another update to the crawl balancer can now alternate between top and bottom of the crawl stack orbiter 2007-03-16 16:54:54 +00:00
  • 91cdc1493f removed query to NAT or responder in case that no other peer is there. this is not needed any more, there are enough peers orbiter 2007-03-16 15:21:24 +00:00
  • 4783a30910 - fixed a flush problem in balancer - return to idle divisor in RWI RAM cache flush orbiter 2007-03-16 15:16:26 +00:00
  • 91c2a042a7 *) bugfix for wrong proxy traffic accounting theli 2007-03-16 13:52:48 +00:00
  • 861f41e67e redesigned NURL-handling: - the general NURL-index for all crawl stack types was splitted into separate indexes for these stacks - the new NURL-index is managed by the crawl balancer - the crawl balancer does not need an internal index any more, it is replaced by the NURL-index - the NURL.Entry was generalized and is now a new class plasmaCrawlEntry - the new class plasmaCrawlEntry replaces also the preNURL.Entry class, and will also replace the switchboardEntry class in the future - the new class plasmaCrawlEntry is more accurate for date entries (holds milliseconds) and can contain larger 'name' entries (anchor tag names) - the EURL object was replaced by a new ZURL object, which is a container for the plasmaCrawlEntry and some tracking information - the EURL index is now filled with ZURL objects - a new index delegatedURL holds ZURL objects about plasmaCrawlEntry obects to track which url is handed over to other peers - redesigned handling of plasmaCrawlEntry - handover, because there is no need any more to convert one entry object into another - found and fixed numerous bugs in the context of crawl state handling - fixed a serious bug in kelondroCache which caused that entries could not be removed - fixed some bugs in online interface and adopted monitor output to new entry objects - adopted yacy protocol to handle new delegatedURL entries all old crawl queues will disappear after this update! orbiter 2007-03-16 13:25:56 +00:00
  • 094a1482f4 *) removing yacy.exe on ant clean theli 2007-03-16 12:44:51 +00:00
  • 832662ccd2 *) removing yacy.jar on ant clean theli 2007-03-16 12:40:43 +00:00
  • 9b5fb3908d *) a peer-message are now created when a blog-comment is written hydrox 2007-03-15 12:58:17 +00:00
  • 581db87237 more debug code for http://www.yacy-forum.de/viewtopic.php?p=33009#33009 orbiter 2007-03-14 15:04:06 +00:00
  • 81c4cc6bf7 better debugging of balancer failure orbiter 2007-03-14 12:02:56 +00:00
  • dd06d4cada more logging to better trace bug http://www.yacy-forum.de/viewtopic.php?p=33001#33001 orbiter 2007-03-14 09:36:54 +00:00
  • 96b79bf86d redesigned remove method in kelondroRowSet This should fix also numerous bugs like http://www.yacy-forum.de/viewtopic.php?p=31077#31077 (java.lang.ArrayIndexOutOfBoundsException in kelondroRowCollection.removeShift) orbiter 2007-03-14 08:55:05 +00:00
  • 9f929b5438 better snippet handling in case of snippet load fail see also http://www.yacy-forum.de/viewtopic.php?p=31096#31096 orbiter 2007-03-13 22:18:36 +00:00
  • d451ad48d3 *) improved peerloadgraphic: - unnecessary (0 %) pieces are removed - percent-values of each thread displayed in legend auron_x 2007-03-12 19:08:17 +00:00
  • a5d668c0c6 added speed-buttons for easy performance setting appears in crawl start and on indexing monitor page orbiter 2007-03-12 16:24:28 +00:00
  • 5b0a84ce09 fix for synchronization deadlock with flushMissNameCache. see also: http://www.yacy-forum.de/viewtopic.php?p=32939#32939 orbiter 2007-03-12 09:06:57 +00:00
  • e2ac5f62bd - Code hübscher machen [von NNs TODO] karlchenofhell 2007-03-11 19:53:14 +00:00
  • f04097c3dd integrated tor-patch for crawling, if yacyDebugMode is set. (replaces: http://yacy.deruwe.de/overlay/net-misc/yacy-tor/files/disable_dns_checks-svn3132.patch) allo 2007-03-11 18:43:11 +00:00
  • 22fe14f292 *) first version of Peerload-graphic auron_x 2007-03-11 17:04:11 +00:00
  • 432d7d4e9c better catch orbiter 2007-03-10 23:38:08 +00:00
  • 8f7e8b6ee2 auto-delete for not-fixable db error in crawl stacker. see also http://www.yacy-forum.de/viewtopic.php?p=32906#32906 orbiter 2007-03-10 23:31:36 +00:00
  • 7a52b07fcc better memory protection during freemen cycle see also http://www.yacy-forum.de/viewtopic.php?p=32903#32903 orbiter 2007-03-10 23:22:37 +00:00
  • 6faa262259 fix for NURL-fix orbiter 2007-03-09 14:30:53 +00:00
  • 909d7a8ae9 fixed wrong implemented row iterator in kelomdroFlexSplitTables this has no effect, until now this iterator was only used on the Index Administration page. orbiter 2007-03-09 13:55:26 +00:00
  • a1fb8358b2 lets make a well-formed http link so that other crawlers don't have a problem to follow this link :-) orbiter 2007-03-09 12:35:54 +00:00
  • 4edb70f68b added yacybot info-page from Roland orbiter 2007-03-09 12:26:31 +00:00
  • 3ef77d2030 fix for http://www.yacy-forum.de/viewtopic.php?p=29878#29878 orbiter 2007-03-09 12:14:25 +00:00
  • 3bb3df3fc0 fix for http://www.yacy-forum.de/viewtopic.php?p=32298#32298 orbiter 2007-03-09 12:03:53 +00:00
  • b3ca177a5d fix for http://www.yacy-forum.de/viewtopic.php?p=32797#32797 orbiter 2007-03-09 11:49:56 +00:00
  • 243a2f831b fixed problem with not found NURL-hashes The cause for this problem could still not be found, but the effect is handled much better. The NURL-pop will continue automatically until it found a hash that can be found. orbiter 2007-03-09 11:07:20 +00:00
  • 6ad39bae1e fixed shutdown problem this fixes the 'inconsistency' messages during start-up orbiter 2007-03-09 08:48:47 +00:00
  • 38b93f8cb8 bugfix for my last commit: iterator did not consider secondary start point in case of rotation orbiter 2007-03-08 22:07:17 +00:00
  • 264a82eec8 - fix for http://www.yacy-forum.de/viewtopic.php?t=3657 - fix for http://www.yacy-forum.de/viewtopic.php?p=32758#32758 - Diff takes any objects now, not only strings karlchenofhell 2007-03-08 22:04:15 +00:00
  • 045d758537 Avoid stopwords as topwords, configurable rramthun 2007-03-08 20:50:27 +00:00
  • d755a8026d - better OOM protection - better memory allocation for FlexTable indexes - splitting between static index and dynamic index (only the dynamic part must grow) - to enable a merge-iteration of new splittet index, a huge number of classes needed to be adopted for new iterator classes - added new iterator classes that support cloneable iterators - adopted all iterator classes to implement cloneable itarators orbiter 2007-03-08 16:15:40 +00:00
  • 2be405e1e1 - fix for last two commits karlchenofhell 2007-03-08 14:00:04 +00:00
  • de1b4a1731 - don't publish news if empty or equal page is submitted in wiki karlchenofhell 2007-03-08 13:50:24 +00:00
  • dcc13abd59 - fixed small bug at home page, button "peer's console" - fixed <fieldset><dl> for safari on many pages - added Blog-link to Network page karlchenofhell 2007-03-08 13:39:09 +00:00
  • 6596167277 *) bugfix for wrong RSS feed pubDate formats theli 2007-03-08 08:37:47 +00:00
  • 0d178d00a5 *) adding RSS feed for peer messages theli 2007-03-08 08:10:36 +00:00
  • 23338d2070 small fix for RAM computation orbiter 2007-03-07 23:55:52 +00:00
  • 33f97cff7a changed startup initialization sequence slightly orbiter 2007-03-07 23:24:16 +00:00
  • 4f2e6ef47b - WatchCrawler_p shows max. 80 characters of URLs now (maybe dynamically adjustable based on browser width?) - typo in BlacklistCleaner karlchenofhell 2007-03-07 23:16:25 +00:00
  • 70cd391ea1 fix for dl/fieldset problem in Safari orbiter 2007-03-07 22:49:32 +00:00
  • 5741701b59 moved crawl start up, personal web pages down in main menu orbiter 2007-03-07 16:08:13 +00:00
  • b627c77df6 - workaround for safari bug with definition lists inside fieldsets in ConfigBasic - alternative can be seen in PerformanceMemory, where a dl is simulated with a table layout orbiter 2007-03-07 15:53:04 +00:00
  • 4e8eb1dbe3 some minor changes here and there orbiter 2007-03-07 14:22:10 +00:00
  • 03c5906ae7 - minor bugfixes for url-fetcher & http://www.yacy-forum.de/viewtopic.php?t=3646 - PerformanceMemory_p.html is valid XHTML again karlchenofhell 2007-03-07 11:50:03 +00:00
  • 3499a364ef a little bit better memory protection orbiter 2007-03-07 09:38:14 +00:00
  • 313f6a7680 fix for http://www.yacy-forum.de/viewtopic.php?p=31553#31553 orbiter 2007-03-07 09:26:01 +00:00
  • 958ebea5c5 fix for http://www.yacy-forum.de/viewtopic.php?p=32470#32470 orbiter 2007-03-07 09:08:13 +00:00
  • 5d5e6ebfcc fix for http://www.yacy-forum.de/viewtopic.php?p=32631#32631 orbiter 2007-03-07 08:54:07 +00:00
  • 8e9bee12fc *) adding guid to yacysearch.rss theli 2007-03-07 05:58:14 +00:00
  • 1cba31de43 redesigned ram organization for database caches - each cache can now allocate as much memory as is available - no more fixed limits - replaced old performance memory monitor by new one - added supervision methods as static functions into the classes that provide cache functionality - steering of ram allocation is done with two simple limits that are ram availability-relative orbiter 2007-03-06 22:43:32 +00:00
  • e934c5b09b *) wrong blog rss feed titel theli 2007-03-06 17:37:21 +00:00
  • ceed0364e2 *) Blog RSS: Image added *) RSS Feed for YaCy Bookmarks added theli 2007-03-06 17:35:24 +00:00
  • 26450a1d9a *) avoid nullpointerException on seed.getAddress() (reported by netbude) theli 2007-03-06 16:11:36 +00:00
  • fc43007490 added .homeip.net borg-0300 2007-03-05 19:22:29 +00:00
  • db235f2d61 added some memory protection in collection index multiple merge orbiter 2007-03-04 22:54:04 +00:00
  • c72605ecab *) adding a function to determine if a given URL is bookmarkt theli 2007-03-03 11:57:49 +00:00
  • bd03c6b874 *) bugfix in bookmarksDB: - NullpointerException when trying to get an unknown bookmark - bookmarks can either start with http or https theli 2007-03-03 11:56:46 +00:00
  • b466baa574 added some memory protection too large collection arrays are now avoided. By default, the biggest collection index is 7. larger collections are dumped into a commons directory, but cannot yet be used. Bevore doing a dump, the collection is splittet into a part which has only root-references, and stored back to the collection; the remaining part goes to commons orbiter 2007-03-03 00:55:51 +00:00
  • ce360ef43e *) no more HTML in plasmaCrawlProfile.java anymore *) <br> will not be displayed in items in Auto Filter Content on WatchCrawler_p.html anymore *) removed unnecessary replaceHTML() low012 2007-03-02 21:09:28 +00:00
  • 93e1ad2bca - fix for last commit karlchenofhell 2007-03-02 01:50:21 +00:00
  • 88245e44d8 - improved version of robots.txt (delete your old htroot/robots.txt before updating): - robots.txt is a servlet now - no need to rewrite the whole file each time a section is added or removed - user-defined disallows, added manually, won't be overwritten anymore - new config-setting: httpd.robots.txt, holding names of the disallowed sections karlchenofhell 2007-03-02 01:19:38 +00:00
  • 9623bf7bbe - removed call of java 1.5 method - added config servlet for local robots.txt - removed YPStats_p as it is of no use anymore - supertemplates use XHTML now - quick-fix for http://www.yacy-forum.de/viewtopic.php?p=32296#32296 karlchenofhell 2007-03-01 13:54:14 +00:00
  • f4c13b422c *updated translation daburna 2007-03-01 09:36:59 +00:00