Commit Graph

  • 958ebea5c5 fix for http://www.yacy-forum.de/viewtopic.php?p=32470#32470 orbiter 2007-03-07 09:08:13 +00:00
  • 5d5e6ebfcc fix for http://www.yacy-forum.de/viewtopic.php?p=32631#32631 orbiter 2007-03-07 08:54:07 +00:00
  • 8e9bee12fc *) adding guid to yacysearch.rss theli 2007-03-07 05:58:14 +00:00
  • 1cba31de43 redesigned ram organization for database caches - each cache can now allocate as much memory as is available - no more fixed limits - replaced old performance memory monitor by new one - added supervision methods as static functions into the classes that provide cache functionality - steering of ram allocation is done with two simple limits that are ram availability-relative orbiter 2007-03-06 22:43:32 +00:00
  • e934c5b09b *) wrong blog rss feed titel theli 2007-03-06 17:37:21 +00:00
  • ceed0364e2 *) Blog RSS: Image added *) RSS Feed for YaCy Bookmarks added theli 2007-03-06 17:35:24 +00:00
  • 26450a1d9a *) avoid nullpointerException on seed.getAddress() (reported by netbude) theli 2007-03-06 16:11:36 +00:00
  • fc43007490 added .homeip.net borg-0300 2007-03-05 19:22:29 +00:00
  • db235f2d61 added some memory protection in collection index multiple merge orbiter 2007-03-04 22:54:04 +00:00
  • c72605ecab *) adding a function to determine if a given URL is bookmarkt theli 2007-03-03 11:57:49 +00:00
  • bd03c6b874 *) bugfix in bookmarksDB: - NullpointerException when trying to get an unknown bookmark - bookmarks can either start with http or https theli 2007-03-03 11:56:46 +00:00
  • b466baa574 added some memory protection too large collection arrays are now avoided. By default, the biggest collection index is 7. larger collections are dumped into a commons directory, but cannot yet be used. Bevore doing a dump, the collection is splittet into a part which has only root-references, and stored back to the collection; the remaining part goes to commons orbiter 2007-03-03 00:55:51 +00:00
  • ce360ef43e *) no more HTML in plasmaCrawlProfile.java anymore *) <br> will not be displayed in items in Auto Filter Content on WatchCrawler_p.html anymore *) removed unnecessary replaceHTML() low012 2007-03-02 21:09:28 +00:00
  • 93e1ad2bca - fix for last commit karlchenofhell 2007-03-02 01:50:21 +00:00
  • 88245e44d8 - improved version of robots.txt (delete your old htroot/robots.txt before updating): - robots.txt is a servlet now - no need to rewrite the whole file each time a section is added or removed - user-defined disallows, added manually, won't be overwritten anymore - new config-setting: httpd.robots.txt, holding names of the disallowed sections karlchenofhell 2007-03-02 01:19:38 +00:00
  • 9623bf7bbe - removed call of java 1.5 method - added config servlet for local robots.txt - removed YPStats_p as it is of no use anymore - supertemplates use XHTML now - quick-fix for http://www.yacy-forum.de/viewtopic.php?p=32296#32296 karlchenofhell 2007-03-01 13:54:14 +00:00
  • f4c13b422c *updated translation daburna 2007-03-01 09:36:59 +00:00
  • 9b33562ed1 *) adding mimetype application/x-rar theli 2007-02-28 13:11:59 +00:00
  • 51e12049fa third generation of R/W head path optimization - data from collection arrays are read in order - merged data is written in order orbiter 2007-02-28 11:13:23 +00:00
  • 1fe505f0b0 - adapted User_p to general web-interface style (and removed status-only page on changes) - beautified WikiHelp.html + typos - IP hasn't been set correctly in Blog.xml karlchenofhell 2007-02-28 09:22:31 +00:00
  • 92b6bc0ad2 - fixed wrongly applied replacement of "<" and ">" in Blog and simplified the code a bit - added check, whether active blacklist engine is supported by blacklist cleaner karlchenofhell 2007-02-28 00:04:32 +00:00
  • a1d68fe092 - use .class rather than Class.forName for classes in class-path - added Bost's patch for Diff.findDiagonale() from: http://www.yacy-forum.de//files/patch_685.txt - fixed minor bugs in Blog karlchenofhell 2007-02-27 22:52:22 +00:00
  • 10a3c20b8d some more enhancements to R/W Head path optimization orbiter 2007-02-27 15:54:02 +00:00
  • f4cfd19835 second Generation of collection R/W head path optimization: - permanent cache flush is switched off. The optimized cache flush works better if it is a large number of collections that is flushed together - the flush size can be configured instead the flush divisor. There is only one size for all flushes - collection records that shall be removed during collection transition (jump from one collection file to another) are now not really removed but only marked in RAM. add-operations to the collection use these marked collection spaces - index bulk write operations are now separated for each file of a kelondroFlex orbiter 2007-02-27 13:01:22 +00:00
  • e92e8b2ae3 *) added RSS-Feed for blog hydrox 2007-02-27 10:05:46 +00:00
  • a107961099 *) fixed blog-comment-deletion without admin-rights is no longer possible *) fixed no empty blog-comments anymore hydrox 2007-02-27 08:37:47 +00:00
  • ea2dbcb034 *updated translation for -blacklistcleaner -blogcomments -header.template *small changes daburna 2007-02-27 07:52:14 +00:00
  • cf47075855 CSS corrects (no author) 2007-02-26 23:03:10 +00:00
  • 1fda50fd3c correct R/W head positioning in kelondroFlex and some enhancements orbiter 2007-02-26 22:25:39 +00:00
  • 116fc016d0 *) fix for Blogcomment-Preview hydrox 2007-02-26 16:18:43 +00:00
  • 304412a049 first generation of collection index R/W head path optimization - collections are now hand-over as collection lists to collection index for merge opertations - collection index lists are separated into 'new' and 'extend' lists - lists are written separately - write operations are done into array sets and array indexes. These are now serialized - write operations into index files are sorted by index; that means that a R/W head does not need to go forward and backward, only forward More enhancements are possible orbiter 2007-02-26 15:49:23 +00:00
  • 54fef3574f *) missing files for last commit hydrox 2007-02-26 14:38:34 +00:00
  • cb89c74d52 *) added blog-comments *) removed debug-output when deleting news hydrox 2007-02-26 14:36:01 +00:00
  • 6fbe31425a - some code-cleanup (no more syntax-warnings here) - added deletion from loadedURLs of URLs to be blacklisted in IndexControl_p karlchenofhell 2007-02-26 12:56:50 +00:00
  • 32867580ee update to kelondroRecords needed fo last commit orbiter 2007-02-26 11:55:36 +00:00
  • e3480d4ad3 fix for warning in crawl balancer orbiter 2007-02-26 11:54:43 +00:00
  • ed021a3f70 bugfix, see http://www.yacy-forum.de/viewtopic.php?t=3573 daburna 2007-02-26 10:05:28 +00:00
  • 31ad42535a - added buttons to add complete domain or single URL to blacklist to IndexControl_p karlchenofhell 2007-02-25 23:14:45 +00:00
  • 8668ac5d91 preparations for collection index cache flush optimization (hand-over commit, no functional change to current code) orbiter 2007-02-25 21:06:26 +00:00
  • 42e9747650 fixed /path/forwarding. uncomment, if you want to use it. allo 2007-02-25 20:07:35 +00:00
  • e0decf4653 - added support for changing invalid entries in blacklist cleaner karlchenofhell 2007-02-25 19:36:05 +00:00
  • c58ef48e1c - increased size of subject text-field karlchenofhell 2007-02-25 18:48:25 +00:00
  • 1d31ebbeec - added experimental PHP script which redirects from a vhost to a peer, using a public seed-file karlchenofhell 2007-02-25 15:18:22 +00:00
  • 9cbf94222f *) added seedurl to network.xml as requested by lulabad auron_x 2007-02-25 10:24:42 +00:00
  • 39a2000d8b - added support for [[Bookmark:$bookmarkTag|description]]-link-listings (requested by theli) to wiki-parser - added support for <pre>-tags to wiki-parser karlchenofhell 2007-02-24 21:26:48 +00:00
  • 619653c054 - fix for last commit karlchenofhell 2007-02-24 15:40:56 +00:00
  • 26f5757b40 - added support for multiple paths per domain to default-blacklist warning: an interface-change had been neccessary: - remove(String, String) has been renamed to removeAll(String, String), because it removes all path-entries for the specified host - remove(String, String, String) has been added to delete only a path-entry - geBlacklistType(String) has been renamed to getBlacklistType(String) karlchenofhell 2007-02-24 13:56:32 +00:00
  • 3d6ab19f7e - remove double entries in blacklist as well karlchenofhell 2007-02-23 18:27:56 +00:00
  • bf7a69197d - fix for possible NPE in queues_p - WatchCrawler_p: - display crawler traffic - pause/resume local- and global crawler karlchenofhell 2007-02-22 22:26:11 +00:00
  • 9702d3abba further supertemplate test allo 2007-02-22 11:01:16 +00:00
  • a5a36d9252 - hopefully last fix fo 1.5 methods (sorry for that, eclipse isn't that helpful in identifying those methods) karlchenofhell 2007-02-22 08:04:09 +00:00
  • e97b6f0458 - we still use Java 1.4 ... karlchenofhell 2007-02-21 22:43:31 +00:00
  • 0c7b8cf632 - added first version of new wiki-parser - added blacklist support to manual URLFetcher stack fill - fix for NPE: http://www.yacy-forum.de/viewtopic.php?t=3559 karlchenofhell 2007-02-21 22:31:36 +00:00
  • f7803a6ce4 enhanced crawl balancer - new domains now get a chance to get crawled early - less IO operations - new balancing method - better dump order at shutdown time - bugfixes regarding not found url hashes (no more superfluous cache kill) - domain access time is now shared over all balancer stacks - viewing the stack does no more disturbish the balancing algorithm that much - intelligent selection of best next domain using domain access times - extra double-check (to double-check the double-check) orbiter 2007-02-21 16:23:31 +00:00
  • 801eea8849 *) Fixed bug where pairReplace() got caught in infinite recursion. (http://www.yacy-forum.de/viewtopic.php?t=3466) low012 2007-02-20 22:07:59 +00:00
  • c8862e47fb *) adding mimetype for svg theli 2007-02-20 15:25:46 +00:00
  • 39b0658839 Redesign of Webinterface menu structure orbiter 2007-02-20 14:35:29 +00:00
  • c3e8c23f5d fix for 'CANNOT FETCH ENTRY: hash is null' bug orbiter 2007-02-20 13:53:21 +00:00
  • badab8d924 fixed some more bugs in new db handling orbiter 2007-02-20 12:29:12 +00:00
  • e72d253577 fixed problem with initial cache load orbiter 2007-02-20 11:20:48 +00:00
  • 2d8e472cfd emergeny bugfix for last commit (kelondroTree should work again) the cache prefill is broken and will be fixed later orbiter 2007-02-20 10:25:17 +00:00
  • 868aaabf88 documentation update orbiter 2007-02-20 08:37:09 +00:00
  • dc0c06e43d PLEASE MAKE A BACK-UP OF YOUR COMPLETE DATA DIRECTORY BEFORE USING THIS redesign for better IO performance enhanced database seek-time by avoiding write operations at distant positions of a database file. until now, a USEDC counter was written at the head-section of a kelondroRecords database file (which is the basic data structure of all kelondro database files) to store the actual number of records that are contained in the database. Now, this value is computed from the database file size. This is either done only once at start-time, or continuously when run in asserts enabled. The counter is then updated only in RAM, and written at close of the file. If the close fails, the correct number can be computed from the file size, and if this is not equal to the stored number it is a strong evidence that YaCY was not shut down properly. To preserve consistency, the complete storage-routine had to be re-written. Another change enhances read of nodes in some cases, where the data-tail can be read together with the data-head. This saves another IO lookup during each DB node fetch. Includes also many small bugfixes. IF ANYTHING GOES WRONG, ALL YOUR DATA IS LOST: PLEASE MAKE A BACK-UP orbiter 2007-02-20 08:35:51 +00:00
  • 5af76fccd7 *) peer-search on Network.html now is case-insensitive hydrox 2007-02-19 13:00:41 +00:00
  • c016fcb10f - added streaming-support to CrawlURLFetchStack_p servlet - bug for NPE in list.java - use more constants karlchenofhell 2007-02-19 12:47:46 +00:00
  • 65af9d3215 - continue shifting even in the case the stacked URL could not be found karlchenofhell 2007-02-18 01:50:41 +00:00
  • fdd1180ac5 Adding two icon files, both containing different sizes from 16x16 to 128x128 pixels in one file. The .icns is for Macintosh rramthun 2007-02-17 22:21:11 +00:00
  • d114a0136e - crawl profile: don't add null-values - added some settings and statistics for url-fetcher 'server'-mode - added own stack for fetchable URLs - added possibility to fill stack via shift from peer's queues, via POST (addurls=$count and url$num=$url) or via file-upload - added "htroot" to classpath of linux start-script karlchenofhell 2007-02-17 19:16:53 +00:00
  • a46dc43f45 - added lock symbol for restart- and stutdown-buttons on Status-page (see http://www.yacy-forum.de/viewtopic.php?p=31444#31444) karlchenofhell 2007-02-16 00:25:45 +00:00
  • b2a9d32f29 why do I always forget some lines? sorry... karlchenofhell 2007-02-14 15:11:03 +00:00
  • e6ddf135bb - enabled fetching new crawls via /yacy/list.html?list=queueUrls for testing purposes - sent URLs are taken off the limit-stack (of the global crawl trigger) (may be moved somewhere else in future versions) - added option to set the requested chunk-size karlchenofhell 2007-02-14 14:50:55 +00:00
  • 67d96249b4 - fix for last commit karlchenofhell 2007-02-13 21:17:43 +00:00
  • c5a2ba3a23 - prepared URL fetch from other peers - more feedback for user karlchenofhell 2007-02-13 20:18:12 +00:00
  • 661a7bb702 *updated translation for -network -wiki daburna 2007-02-12 20:53:47 +00:00
  • 5ba531a722 *) higher precision for QPH also on status-page auron_x 2007-02-11 09:33:39 +00:00
  • 4e5eda6ef9 huch... karlchenofhell 2007-02-10 20:25:45 +00:00
  • 50b59e312f - added experimental CrawlURLFetch_p-Servlet to fetch new URLs from a specified location (\n-seperated list). Requested by Theli. karlchenofhell 2007-02-10 20:20:00 +00:00
  • 6c6375577e - fix for http://www.yacy-forum.de/viewtopic.php?t=3523 karlchenofhell 2007-02-10 11:50:18 +00:00
  • ea20d8d7c5 - return to edited wiki-page after submit karlchenofhell 2007-02-09 19:47:21 +00:00
  • e1edb23689 *) Bugfix for IllegalMonitorStateException See: http://www.yacy-forum.de/viewtopic.php?t=3522 theli 2007-02-09 19:32:49 +00:00
  • bf69a721cb more protection against mis-use of YaCyHop interface: - target must not be at port 80 - target access not more than every 3 seconds - requester may not access more than every 10 seconds orbiter 2007-02-09 15:25:10 +00:00
  • a15963ff98 better balancing: if element from top would force a busy waiting, an element from the bottom of the stack is used instead. orbiter 2007-02-09 10:32:58 +00:00
  • dda24fcb85 ups orbiter 2007-02-09 09:55:21 +00:00
  • 8c1d2e0227 protection against crawl balancer failure: a minimum of 500 milliseconds distance between two acesses to the same domain is now ensured orbiter 2007-02-09 09:48:23 +00:00
  • 1f1f398bfa enhanced speed of RAM cache flush by factor 20 (twenty times faster) - the speed was doubled by avoiding read access during the dump - the speed was dramatically increased at least by factor 10 by using a temporary ram-file where the structures are flushed to before it is dumped then as a whole byte-chunk to the file system. The speed enhancements also affects some other parts of the database. orbiter 2007-02-08 23:21:46 +00:00
  • 30d79d69a6 fix for wrong display of search statistics see http://www.yacy-forum.de/viewtopic.php?p=31242#31242 orbiter 2007-02-08 10:42:35 +00:00
  • ac376662cc *) changing alternate link to relative link *) fix for wrong date theli 2007-02-08 06:19:45 +00:00
  • b4981187c5 *) adding alternate link to rss theli 2007-02-08 06:16:52 +00:00
  • 24e3dd4734 *) first version of yacy changelog RSS Feed See: http://www.yacy-forum.de/viewtopic.php?t=3462 theli 2007-02-08 06:13:30 +00:00
  • daf2e15f59 some storage process enhancements (write without preceding read) orbiter 2007-02-07 23:23:24 +00:00
  • faad869865 *) added peer-search to Network.html hydrox 2007-02-07 11:20:31 +00:00
  • 9c2101a852 small enhancement to cache dump orbiter 2007-02-07 00:02:54 +00:00
  • c464157a6e replaced some toString() see http://www.yacy-forum.de/viewtopic.php?p=31151#31151 orbiter 2007-02-06 16:26:56 +00:00
  • 7673f0869b minor enhancements orbiter 2007-02-06 16:01:03 +00:00
  • b4aa195c27 added user-agent check for yacy-hop proxy authentication orbiter 2007-02-06 09:53:02 +00:00
  • 2d7f7da7ce fix for null pointer exception orbiter 2007-02-06 09:50:24 +00:00
  • 6256d89883 *) bugfix for reg.exp to determine svn rev. nr theli 2007-02-06 07:47:56 +00:00
  • d25caa07bf redesigned some parts of http authentication added another access check for peer hops orbiter 2007-02-05 19:46:50 +00:00
  • 588e48ce0b *) Part II of last commit. Note to myself: check svn commandline syntax :-( low012 2007-02-05 18:21:11 +00:00
  • 0d2431d6f7 *) removed printed out '<br />' in row Hit-Size Miss-Size by moving <br /> from Java file to HTML file. low012 2007-02-05 18:16:01 +00:00