Commit Graph

  • 6e7340ef52 added exclusion search (you can now search and exclude words from the result with '-') orbiter 2007-04-03 15:35:29 +00:00
  • e4734a8b6b fix for fix in SVN 3537 orbiter 2007-04-03 14:42:29 +00:00
  • c1dd6a674a erased stopwords. We need a different solution here. Stopwords must consider experiences from our new common words and it must distinguish different languages. orbiter 2007-04-03 12:43:40 +00:00
  • 356033aceb fixed bug with continuous reset of balancer file index orbiter 2007-04-03 12:36:24 +00:00
  • ba2c307ab3 optimized memory allocation in kelondroRow.Entry such an entry cannot be instantiated without allocation of new byte[]; instead it can re-use memory from other kelondroRow.Entry objects. during bugfixing also other bugs may have been solved, maybe the INCONSISTENCY problem could have been solved. One cause can be missing synchronization during bulk storage when a R/W-path optimization is done. To test this case, the optimization is currently switched off. More memory enhancements can be done after this initial change to the allocation scheme. orbiter 2007-04-03 12:10:12 +00:00
  • 24ea4ca631 *) adding first version of postscript parser theli 2007-04-01 15:02:07 +00:00
  • def0d6124e *) trying to solve SecurityManager problem during init of soap engine theli 2007-03-31 16:01:52 +00:00
  • 75eb65028a *) adding a test if a seucrity manager is active theli 2007-03-31 14:30:09 +00:00
  • 210ede8230 added a class for byte-array management. This was the result of a very large experiment to replace byte[] objects within kelondro. Frequent System.arraycopy are common when kelondroRow.Entry objects are handled. This class may be used to prevent this. However, experimental replacement of byte[] by kelondroByteArray in kelondroRow.Entry resulted in complete re-write of large parts of kelondro. This experiment did not completely lead to a result, because then the interface to kelondro had to be changed also from byte[] to kelondroByteArray, which may have caused a rewrite of large parts of YaCy. The experiment is therefore abanonded, but this class remains here without any function but possibly for future use. orbiter 2007-03-30 08:44:43 +00:00
  • bc37ac64b3 *) Fix for last commit. low012 2007-03-28 22:37:01 +00:00
  • f603b58f6c *) No stacktrace anymore if invalid regex is entered for URL mask or Prefer mask, insted an error message gets displayed. low012 2007-03-28 22:18:54 +00:00
  • 1b7fda12ee *) SOAP: separate function to get the active/passive/potential peer list theli 2007-03-28 07:34:44 +00:00
  • 6488ec8a80 no deletions in index in case that snippet-loading fails and there is no network connection orbiter 2007-03-27 08:21:45 +00:00
  • 847349358b less memory usage during collectionIndex-rebuild should also speed up that process a little bit orbiter 2007-03-27 08:21:03 +00:00
  • 8ef3ad12a7 *) fix for rare bug in PPM-calc auron_x 2007-03-25 21:46:03 +00:00
  • 00bc0c1b47 *) new logging for PPM-Calculation auron_x 2007-03-25 20:24:12 +00:00
  • 5941577076 *) added some logging to PPM-Calculation to find a rare bug auron_x 2007-03-25 14:56:42 +00:00
  • 2522c26921 *) peer-search now checks matches to peername and peerhash hydrox 2007-03-24 16:51:19 +00:00
  • 5c3afb3202 added option to configure a path to a secondary index location. this shall be used to store a fragment of the index on another physical device, to split IO load and enhance access speed. The index is splitted in such a way that the LURLs are stored to the secondary location, and the RWIs to the primary location. This is especially useful for environments where symbolic links are not possible and may cause IO access even if there is no write access to the device which hosts the symbolic link. orbiter 2007-03-24 15:28:17 +00:00
  • 07cd30cf9b *) minor changes for last commit theli 2007-03-23 06:35:47 +00:00
  • 51b2df566b *) adding possibility to display a fileshare-dir as RSS feed (e.g. to monitor the directory for changes) usage example: http://localhost:8080/share/?format=rss theli 2007-03-23 06:25:21 +00:00
  • c2e6afbd69 *) bugfix: setting mimeType properly for dir listing with e.g. "?format=xml" theli 2007-03-23 05:37:19 +00:00
  • 242c19b480 completed TLD categorization orbiter 2007-03-22 13:52:00 +00:00
  • 146f4aee01 *) adding mimetype for opml theli 2007-03-22 12:37:14 +00:00
  • b99f9d870d *) fixed double selection of peers for the same DHT-chunk. hydrox 2007-03-22 09:08:38 +00:00
  • e6681b2a79 *) changing RSS feed titles theli 2007-03-22 08:52:58 +00:00
  • f20b596dc0 *) adding servlet to display all deployed SOAP Services - soap related servlets are located in htroot/soap *) new serverContext class for soap theli 2007-03-22 08:30:57 +00:00
  • 7edd5a0b77 *) correcting notifier.gif path theli 2007-03-22 07:42:28 +00:00
  • 8463e29b14 removes addons from minimal install allo 2007-03-21 19:50:31 +00:00
  • df15f71a5c *) avoid NullpointerExceptions if Seed is null theli 2007-03-21 17:21:28 +00:00
  • 75d90834a2 *) adding additional file extension for powerpoint theli 2007-03-21 16:18:58 +00:00
  • 2cb16824e3 removed support for old database structures. The new collection index will be more generalized to support other indexes i.e. YBR block-rank computation. A clean-up of the many conditions to support the old database was necessary. orbiter 2007-03-21 15:35:35 +00:00
  • 716b3d1533 *) updating feed item link theli 2007-03-21 15:11:45 +00:00
  • 81b4598487 *) peer profile can now be displayed as vcard e.g. http://localhost:8080/ViewProfile.vcf?hash=localhash theli 2007-03-21 15:08:18 +00:00
  • 70bd67e73e documentation update orbiter 2007-03-21 14:29:42 +00:00
  • a53403e0b2 *) Updating News Feed for new release theli 2007-03-21 14:13:13 +00:00
  • 3688ec33e5 release 0.51 orbiter 2007-03-21 14:00:17 +00:00
  • 1f61c13697 *) RSS-parser extracts the author tags now theli 2007-03-21 13:35:32 +00:00
  • 602ac42010 fix for OOM case when a kelondroTree Node cache grows See also: http://www.yacy-forum.de/viewtopic.php?p=33275#33275 orbiter 2007-03-21 13:26:18 +00:00
  • b374812f01 *) adding rpm packager as author theli 2007-03-21 13:09:12 +00:00
  • beb772d6cd fixed problem with broken notifier image, occurred only at initial start-up orbiter 2007-03-21 12:23:27 +00:00
  • 40ce33e664 *) adding RSS feed for yacy news theli 2007-03-21 12:22:18 +00:00
  • 589cbd8cbf *) replacing all yacy-news-category strings with corresponding constants Note: please use these constants from now on theli 2007-03-21 11:09:15 +00:00
  • f4af360f7c bugfix allo 2007-03-20 15:37:19 +00:00
  • bb51efbb49 "Bugfix" for Tagdisplay allo 2007-03-19 13:00:33 +00:00
  • 43114af6d7 *) Translated robots.txt-config-page *) Simplified some sentences rramthun 2007-03-19 12:42:07 +00:00
  • 7af188ff9a fix for http://www.yacy-forum.de/viewtopic.php?p=33089#33089 orbiter 2007-03-19 11:59:29 +00:00
  • 5bbf010107 removed synchronization of size() method from numerous classes to avoid thread locking orbiter 2007-03-18 19:45:23 +00:00
  • 6b9eea3932 - removed differentiation between longTitle and shortTitle; this cannot be used for search results, and it is difficult to get both types from all document types - added some author parsing orbiter 2007-03-18 12:33:19 +00:00
  • a738b57b31 added author tag to indexing content enhanced composition of title tag TODO: insert author information for external parsers orbiter 2007-03-17 01:18:34 +00:00
  • 6be57983a8 another update to the crawl balancer can now alternate between top and bottom of the crawl stack orbiter 2007-03-16 16:54:54 +00:00
  • 91cdc1493f removed query to NAT or responder in case that no other peer is there. this is not needed any more, there are enough peers orbiter 2007-03-16 15:21:24 +00:00
  • 4783a30910 - fixed a flush problem in balancer - return to idle divisor in RWI RAM cache flush orbiter 2007-03-16 15:16:26 +00:00
  • 91c2a042a7 *) bugfix for wrong proxy traffic accounting theli 2007-03-16 13:52:48 +00:00
  • 861f41e67e redesigned NURL-handling: - the general NURL-index for all crawl stack types was splitted into separate indexes for these stacks - the new NURL-index is managed by the crawl balancer - the crawl balancer does not need an internal index any more, it is replaced by the NURL-index - the NURL.Entry was generalized and is now a new class plasmaCrawlEntry - the new class plasmaCrawlEntry replaces also the preNURL.Entry class, and will also replace the switchboardEntry class in the future - the new class plasmaCrawlEntry is more accurate for date entries (holds milliseconds) and can contain larger 'name' entries (anchor tag names) - the EURL object was replaced by a new ZURL object, which is a container for the plasmaCrawlEntry and some tracking information - the EURL index is now filled with ZURL objects - a new index delegatedURL holds ZURL objects about plasmaCrawlEntry obects to track which url is handed over to other peers - redesigned handling of plasmaCrawlEntry - handover, because there is no need any more to convert one entry object into another - found and fixed numerous bugs in the context of crawl state handling - fixed a serious bug in kelondroCache which caused that entries could not be removed - fixed some bugs in online interface and adopted monitor output to new entry objects - adopted yacy protocol to handle new delegatedURL entries all old crawl queues will disappear after this update! orbiter 2007-03-16 13:25:56 +00:00
  • 094a1482f4 *) removing yacy.exe on ant clean theli 2007-03-16 12:44:51 +00:00
  • 832662ccd2 *) removing yacy.jar on ant clean theli 2007-03-16 12:40:43 +00:00
  • 9b5fb3908d *) a peer-message are now created when a blog-comment is written hydrox 2007-03-15 12:58:17 +00:00
  • 581db87237 more debug code for http://www.yacy-forum.de/viewtopic.php?p=33009#33009 orbiter 2007-03-14 15:04:06 +00:00
  • 81c4cc6bf7 better debugging of balancer failure orbiter 2007-03-14 12:02:56 +00:00
  • dd06d4cada more logging to better trace bug http://www.yacy-forum.de/viewtopic.php?p=33001#33001 orbiter 2007-03-14 09:36:54 +00:00
  • 96b79bf86d redesigned remove method in kelondroRowSet This should fix also numerous bugs like http://www.yacy-forum.de/viewtopic.php?p=31077#31077 (java.lang.ArrayIndexOutOfBoundsException in kelondroRowCollection.removeShift) orbiter 2007-03-14 08:55:05 +00:00
  • 9f929b5438 better snippet handling in case of snippet load fail see also http://www.yacy-forum.de/viewtopic.php?p=31096#31096 orbiter 2007-03-13 22:18:36 +00:00
  • d451ad48d3 *) improved peerloadgraphic: - unnecessary (0 %) pieces are removed - percent-values of each thread displayed in legend auron_x 2007-03-12 19:08:17 +00:00
  • a5d668c0c6 added speed-buttons for easy performance setting appears in crawl start and on indexing monitor page orbiter 2007-03-12 16:24:28 +00:00
  • 5b0a84ce09 fix for synchronization deadlock with flushMissNameCache. see also: http://www.yacy-forum.de/viewtopic.php?p=32939#32939 orbiter 2007-03-12 09:06:57 +00:00
  • e2ac5f62bd - Code hübscher machen [von NNs TODO] karlchenofhell 2007-03-11 19:53:14 +00:00
  • f04097c3dd integrated tor-patch for crawling, if yacyDebugMode is set. (replaces: http://yacy.deruwe.de/overlay/net-misc/yacy-tor/files/disable_dns_checks-svn3132.patch) allo 2007-03-11 18:43:11 +00:00
  • 22fe14f292 *) first version of Peerload-graphic auron_x 2007-03-11 17:04:11 +00:00
  • 432d7d4e9c better catch orbiter 2007-03-10 23:38:08 +00:00
  • 8f7e8b6ee2 auto-delete for not-fixable db error in crawl stacker. see also http://www.yacy-forum.de/viewtopic.php?p=32906#32906 orbiter 2007-03-10 23:31:36 +00:00
  • 7a52b07fcc better memory protection during freemen cycle see also http://www.yacy-forum.de/viewtopic.php?p=32903#32903 orbiter 2007-03-10 23:22:37 +00:00
  • 6faa262259 fix for NURL-fix orbiter 2007-03-09 14:30:53 +00:00
  • 909d7a8ae9 fixed wrong implemented row iterator in kelomdroFlexSplitTables this has no effect, until now this iterator was only used on the Index Administration page. orbiter 2007-03-09 13:55:26 +00:00
  • a1fb8358b2 lets make a well-formed http link so that other crawlers don't have a problem to follow this link :-) orbiter 2007-03-09 12:35:54 +00:00
  • 4edb70f68b added yacybot info-page from Roland orbiter 2007-03-09 12:26:31 +00:00
  • 3ef77d2030 fix for http://www.yacy-forum.de/viewtopic.php?p=29878#29878 orbiter 2007-03-09 12:14:25 +00:00
  • 3bb3df3fc0 fix for http://www.yacy-forum.de/viewtopic.php?p=32298#32298 orbiter 2007-03-09 12:03:53 +00:00
  • b3ca177a5d fix for http://www.yacy-forum.de/viewtopic.php?p=32797#32797 orbiter 2007-03-09 11:49:56 +00:00
  • 243a2f831b fixed problem with not found NURL-hashes The cause for this problem could still not be found, but the effect is handled much better. The NURL-pop will continue automatically until it found a hash that can be found. orbiter 2007-03-09 11:07:20 +00:00
  • 6ad39bae1e fixed shutdown problem this fixes the 'inconsistency' messages during start-up orbiter 2007-03-09 08:48:47 +00:00
  • 38b93f8cb8 bugfix for my last commit: iterator did not consider secondary start point in case of rotation orbiter 2007-03-08 22:07:17 +00:00
  • 264a82eec8 - fix for http://www.yacy-forum.de/viewtopic.php?t=3657 - fix for http://www.yacy-forum.de/viewtopic.php?p=32758#32758 - Diff takes any objects now, not only strings karlchenofhell 2007-03-08 22:04:15 +00:00
  • 045d758537 Avoid stopwords as topwords, configurable rramthun 2007-03-08 20:50:27 +00:00
  • d755a8026d - better OOM protection - better memory allocation for FlexTable indexes - splitting between static index and dynamic index (only the dynamic part must grow) - to enable a merge-iteration of new splittet index, a huge number of classes needed to be adopted for new iterator classes - added new iterator classes that support cloneable iterators - adopted all iterator classes to implement cloneable itarators orbiter 2007-03-08 16:15:40 +00:00
  • 2be405e1e1 - fix for last two commits karlchenofhell 2007-03-08 14:00:04 +00:00
  • de1b4a1731 - don't publish news if empty or equal page is submitted in wiki karlchenofhell 2007-03-08 13:50:24 +00:00
  • dcc13abd59 - fixed small bug at home page, button "peer's console" - fixed <fieldset><dl> for safari on many pages - added Blog-link to Network page karlchenofhell 2007-03-08 13:39:09 +00:00
  • 6596167277 *) bugfix for wrong RSS feed pubDate formats theli 2007-03-08 08:37:47 +00:00
  • 0d178d00a5 *) adding RSS feed for peer messages theli 2007-03-08 08:10:36 +00:00
  • 23338d2070 small fix for RAM computation orbiter 2007-03-07 23:55:52 +00:00
  • 33f97cff7a changed startup initialization sequence slightly orbiter 2007-03-07 23:24:16 +00:00
  • 4f2e6ef47b - WatchCrawler_p shows max. 80 characters of URLs now (maybe dynamically adjustable based on browser width?) - typo in BlacklistCleaner karlchenofhell 2007-03-07 23:16:25 +00:00
  • 70cd391ea1 fix for dl/fieldset problem in Safari orbiter 2007-03-07 22:49:32 +00:00
  • 5741701b59 moved crawl start up, personal web pages down in main menu orbiter 2007-03-07 16:08:13 +00:00
  • b627c77df6 - workaround for safari bug with definition lists inside fieldsets in ConfigBasic - alternative can be seen in PerformanceMemory, where a dl is simulated with a table layout orbiter 2007-03-07 15:53:04 +00:00
  • 4e8eb1dbe3 some minor changes here and there orbiter 2007-03-07 14:22:10 +00:00
  • 03c5906ae7 - minor bugfixes for url-fetcher & http://www.yacy-forum.de/viewtopic.php?t=3646 - PerformanceMemory_p.html is valid XHTML again karlchenofhell 2007-03-07 11:50:03 +00:00
  • 3499a364ef a little bit better memory protection orbiter 2007-03-07 09:38:14 +00:00
  • 313f6a7680 fix for http://www.yacy-forum.de/viewtopic.php?p=31553#31553 orbiter 2007-03-07 09:26:01 +00:00