Commit Graph

  • 22ed9c380c *) fixed bug which was introduced in r7226 (shame on me) which made wiki unusable (all entries were stored with empty subject as key -> edits were lost) low012 2010-10-18 21:06:23 +00:00
  • 60fd2e549d * log failures when writing config file f1ori 2010-10-18 15:00:29 +00:00
  • 58e74282af added a word counter statistic in condenser which is used by the did-you-mean to calculate best matches for given search words. orbiter 2010-10-18 11:35:09 +00:00
  • 2a0eb09e08 enhanced html id names and tag cloud visualization orbiter 2010-10-18 09:36:49 +00:00
  • 863065abc4 added user agent logging to access tracker orbiter 2010-10-18 08:09:59 +00:00
  • 61e87c0b14 IndexControlRWIs_p.html, IndexControlURLs_p.html, ViewFile.html/.java: changes to HTML output and   in case of empty values for XHTML strict / transitional validation de.lng: Added missing translation for Show Content and changed existing line --> Index Administration should now correctly validate XHTML 1.0 Strict / Trans mikeworks 2010-10-17 16:51:29 +00:00
  • a79728b97d some updates to experimental bookmark code... apfelmaennchen 2010-10-17 09:58:50 +00:00
  • ef782cd026 and even more experimental bookmark code... apfelmaennchen 2010-10-16 10:20:41 +00:00
  • ed4371dcf3 enhanced navigation implementation and enhanced tag cloud computation orbiter 2010-10-15 23:45:12 +00:00
  • ca738ac924 - added a tag cloud to search results (using the topics) - some refactoring of score classes - added default package for new classes add_ymark and delete_ymark orbiter 2010-10-15 22:01:39 +00:00
  • 7aca763ca8 Some more experimental bookmark code... apfelmaennchen 2010-10-15 12:53:41 +00:00
  • 4270ed696c Experimental code (I need to transfer the code to my macbook, sorry) for the new bookmarks API based on the Tables concept (same as for crawl starts). Currently you can add a bookmark by api/ymarks/add_ymark.xml?url=http://www.yacy.net&title=YaCy and watch the result via the standard view Tables_p.html. apfelmaennchen 2010-10-15 05:40:19 +00:00
  • e4d561971e added more score cluster options and made score cluster usage more transparent orbiter 2010-10-14 11:40:02 +00:00
  • e8f90201a5 fix for scheduling of rss feeds orbiter 2010-10-13 13:00:36 +00:00
  • 7cd9d9d22a - enhanced DidYouMean computation using a faster count on index entries; this causes that results can be ranked better - added limitations on DidYouMean result sets according to input and output string length orbiter 2010-10-12 22:02:10 +00:00
  • beb65437d2 additional fix for the widget - now a second result page is loaded automatically in case of too little search results for the scroll event to trigger apfelmaennchen 2010-10-12 21:55:24 +00:00
  • 2bb0c9b503 Fix for search widget keyup event handling. ESC will close the widget window and RIGHT will load additional search results, especially when the scroll event won't work because of too litte results. apfelmaennchen 2010-10-12 21:28:28 +00:00
  • de722090b5 enhancements in did-you-mean guessing orbiter 2010-10-12 09:45:15 +00:00
  • a59c885ee0 autocomplete and did-you-mean can now understand _all_ languages and can generate suggestions in all languages and character types orbiter 2010-10-12 08:36:33 +00:00
  • b7acd92ce4 Auto-Suggestions for YaCy Search: - added a suggest servlet according to opensearch and firefox standard - integrated the suggest servlet into opensearch description file - integrated a autocomplete plugin for jquery - added a autocomplete addition to the yacy search windows showing autosuggest queries orbiter 2010-10-12 01:23:49 +00:00
  • 67b3b4b13b de.lng: Added translation for skin color picker on http://demo.zuum.net:8080/ConfigAppearance_p.html ConfigAppearance_p.html: Some HTML 1.0 Strict changes on the Customization page http://demo.zuum.net:8080/ConfigAppearance_p.html --> Now all Customization pages should validate XHTML 1.0 strict mikeworks 2010-10-12 01:03:10 +00:00
  • 24f1cba7b2 performance hacks: - faster generation of index abstract compression during remote search - less synchronization in IO record reading - request index abstract generation only if necessary and faster time-out in remote search orbiter 2010-10-11 12:44:07 +00:00
  • 6a166c2040 patches for bad proxy behaviour - accept ipv6 localhost clients - index media files (url only) orbiter 2010-10-11 11:38:36 +00:00
  • d607b30b6a performance enhancements for search and code review for database functions - removed read cache from Records data structure because the read cache had no cache hit during search operation - copied old read-cache class to CachedRecords and the old, now new Records class does not have the cache any more and a code review checked that data structures and synchronization is clean - removed unnecessary synchronization from Table class during get() orbiter 2010-10-11 11:01:50 +00:00
  • 6d61b80fb6 added ColorPicker to WatchWebStructure orbiter 2010-10-11 08:57:40 +00:00
  • 45b1ab3d07 custom + generic skins: - added a generic skin which is filled with actual color assignment using a servlet - enabled css servlets - added a generic color scheme in configuration file - added configuration input in Customization/Appearance servlet - added a jquery color picker widget - placed color picked widget to input field of generic colour definition input fields orbiter 2010-10-11 00:00:10 +00:00
  • 41bf8ef9f9 de.lng: Added translation for Link List option in Crawl Site page http://localhost:8080/CrawlStartSite_p.html mikeworks 2010-10-10 10:44:15 +00:00
  • fcd40cd30f - disabled domZones (buggy, must think about better solution) - increased time-out for dns resolver and isLocal property orbiter 2010-10-09 10:17:50 +00:00
  • ec38eca278 fix for new URI equal method orbiter 2010-10-09 09:27:31 +00:00
  • 0d363a94d7 more performance hacks this makes YaCy search results VERY fast for all verify=false search cases and it enhances the search speed also for all other snippet-fetch cases. With this change my peer performed 100 Queries Per Second (!!!) while doing 10 queries simultanously (!!!) in an intranet index of 20000 URLs on my 16-core Mac orbiter 2010-10-09 08:55:57 +00:00
  • b8aee6d402 performance hacks for better search performance orbiter 2010-10-08 23:50:28 +00:00
  • 11bebe356b fixed crawl start: with SVN 7225 the name of the crawl start url was not given in input field and therefore all crawl starts had contained the empty string as crawl start url orbiter 2010-10-08 22:02:24 +00:00
  • 2971c91988 fix for http://forum.yacy-websuche.de/viewtopic.php?p=20977#p20977 orbiter 2010-10-08 21:44:28 +00:00
  • 091dd3f6ec - enhanced intranet search speed - enhanced intranet portscan speed (better time-out) orbiter 2010-10-08 10:54:13 +00:00
  • b9f405d1e8 *) added comments *) more beautyful and easier to understand code (IMO) *) added display= parameter to a lot of links in Wiki.html low012 2010-10-08 00:32:50 +00:00
  • 70576e88d2 de.lng: Added some more untranslated strings I found and uncommented old ones that were removed terminal_p.html: Put back the old ID which was really easy to find IndexCreate.js: Because XHTML 1.0 Strict does not allow name tags for some elements rewrote most element access functions to use getElementById Table_API_p.html and all other html pages: Some XHTMl 1.0 Strict fixes, changed checkAll javascript, marked the first row with checkboxes as unsortable where applicable Table_API_p.java and all other java pages: URLencoded lines with possible ampersands & -> & for validation XHTML 1.0 Strict sourcecode --> All Index Create pages should validate now. Hope I did not break anything else (too much :-) mikeworks 2010-10-06 00:00:23 +00:00
  • efa59250f8 release 0.98 for SuMa-eV Demo tomorrow orbiter 2010-10-05 17:52:54 +00:00
  • 6e6994e328 latest bugfixes to search and indexing function after test of demo presentation orbiter 2010-10-05 17:49:53 +00:00
  • c3bf17a3a1 fixed must-match filter for smb crawling orbiter 2010-10-05 00:05:08 +00:00
  • 099def2a04 small changes in search widget appearance orbiter 2010-10-04 23:43:33 +00:00
  • 50586a0dfd rename of widget to 'widget' orbiter 2010-10-04 19:57:29 +00:00
  • dffa142529 Fix for author navigator in yacyui-portalsearch.js apfelmaennchen 2010-10-04 19:27:12 +00:00
  • 574346f8ce better must-match pattern for intranet file-crawls orbiter 2010-10-04 12:55:39 +00:00
  • aacf572a26 - enhancements for search speed - bug fixes in many classes including basic data structure classes orbiter 2010-10-04 11:54:48 +00:00
  • aa6075402a smal fix for crawling from 'sitelist' at changes from 7214 sixcooler 2010-10-01 22:41:28 +00:00
  • 61c82f3105 gzip-compresson @ transferRWI & transferURL back again This reduce upload-volume to suit limited bandwidth of home-users like me :-) sixcooler 2010-10-01 00:42:43 +00:00
  • 2c549ae341 fixed a number of small bugs: - better crawl star for files paths and smb paths - added time-out wrapper for dns resolving and reverse resolving to prevent blockings - fixed intranet scanner result list check boxes - prevented htcache usage in case of file and smb crawling (not necessary, documents are locally available) - fixed rss feed loader - fixes sitemap loader which had not been restricted to single files (crawl-depth must be zero) - clearing of crawl result lists when a network switch was done - higher maximum file size for crawler orbiter 2010-09-30 23:57:58 +00:00
  • f6eebb6f99 replaced auto-dom filter with easy-to-understand Site Link-List crawler option - nobody understand the auto-dom filter without a lenghtly introduction about the function of a crawler - nobody ever used the auto-dom filter other than with a crawl depth of 1 - the auto-dom filter was buggy since the filter did not survive a restart and then a search index contained waste - the function of the auto-dom filter was in fact to just load a link list from the given start url and then start separate crawls for all these urls restricted by their domain - the new Site Link-List option shows the target urls in real-time during input of the start url (like the robots check) and gives a transparent feed-back what it does before it can be used - the new option also fits into the easy site-crawl start menu orbiter 2010-09-30 12:50:34 +00:00
  • 63e387508c ConfigLanguage_p.java: Fixed the filename for the API call to ConfigLanguage_p.html - previously ConfigLanguage.html was recorded and the action could not be replayed with error 404 - Not found mikeworks 2010-09-30 03:16:17 +00:00
  • f468d377d7 Collage.html and Collage.java: Added <p> in body before images for XHTML 1.0 Transitional validationg and alt tag to images as well as closing tag <img (...) /> terminal_p.html: Set new link for starting a crawl to CrawlStartSite_p.html and replaced the old embed object of the Among.us Flash object by their new JS which takes care of adding the object correctly de.lng: Moved the translations for the JS part from yacyinteractive.html to the yacyinteractive.js part --> Terminal page is now valid XHTML 1.0 Transitional mikeworks 2010-09-29 23:11:29 +00:00
  • 3057a0b939 - intranet scanner now produces urls with host names, not ips if possible - CrawStartIntranet servlet shows IPs and host names orbiter 2010-09-29 22:44:49 +00:00
  • 75964909aa added missing path to htroot (may only be necessary for cross-linking of servlet classes) orbiter 2010-09-29 22:19:41 +00:00
  • 421aa6a8bb ConfigLiveSearch.html: Fixed some HTML problems to validate at least XHTML 1.0 Transitional - strict is not possible because iframes are used. Replacing iframes with embedded object tag does not work in IE ConfigPortal.html: Fixed some HTML problems to validate at least XHTML 1.0 Transitional - for strict the target attribute of the a link has to be removed yacyinteractive.html: Moved all JS code to an external yacyinteractive.js file in JS folder yacysearch.html: Removed embedded scripts from in between the body tags - now everything is loaded in the header de.lng: Just in case JS files will be parsed at some point added translation for yacyinteractive.html result counter mikeworks 2010-09-29 20:06:39 +00:00
  • c60aed4435 no caching in browser of dynamic web pages sent by YaCy http this may prevent unnecessary IO caused by cache storage of the browser orbiter 2010-09-29 19:56:42 +00:00
  • 41a93ff565 de.lng: Small changes to the German translation and added missing part in Surftips.html and YaCy statistics link in left menu in header.template mikeworks 2010-09-29 06:08:19 +00:00
  • b7bb0cabaf Blacklist_p.html: Minor HTML and Javascript changes to get XHTML 1.0 Strict validation, lowercae onchange, id tags instead of name tags mikeworks 2010-09-29 05:41:32 +00:00
  • cd505d7e30 de.lng: German translation of the new Intranet Servlet introduced in SVN 7203 in CrawlStartIntranet_p.html CrawlStartIntranet_p.html: New Intranet Crawl Start Servlet - minor HTML changes to get XHTML 1.0 Strict validation, remove (double) name tags, remove single ending </dt> mikeworks 2010-09-28 22:05:12 +00:00
  • e63896f2a8 added an intranet scanner and a servlet which shows all intranet addresses and an option to start a site-crawl for all these addresses at once. orbiter 2010-09-28 12:18:54 +00:00
  • e54cb7fb0c more bugfixes (also for latest commit) orbiter 2010-09-28 10:20:46 +00:00
  • be6b48311c misc bugfixes orbiter 2010-09-28 10:00:33 +00:00
  • 44874f2cb9 Added "encoding =" UTF-8 "in the RSS files suessthomas 2010-09-27 20:36:49 +00:00
  • d2fd93135c - moved yacybot user agent string definition to MultiProtocolURI since there are basic access mechanisms where the bot string is needed - migrated the 'yacy' user agent to 'yacybot' in many client methods since the 'yacy' user agent is only used for the proxy orbiter 2010-09-27 14:54:32 +00:00
  • fc3ee9d8a1 changed paths for new libs also in Info.plist when changing paths in build.xml, please also update the paths in /addon/YaCy.app/Contents/Info.plist orbiter 2010-09-27 14:49:51 +00:00
  • f4357dff03 bump to httpclient-4.0.3 which fixes a number of bugs sixcooler 2010-09-27 13:24:40 +00:00
  • ad7efe6016 rssTerminal.html: Fixing the 'null' is null or not an object in rss2.js when viewing the YaCy default Status page http://localhost:8080/Status.html with Internet Explorer feed.xml: copy of feed.rss that helps Internet Explorer also read the Feed - workaround for the fix above Problem is described in the forums and should be fixed better ;-(http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2766&p=20702) mikeworks 2010-09-26 22:55:52 +00:00
  • 190de644dd de.lng: Added German translations for some missing table content on Network view WatchWebStructure_p.html: Added JS verification of RGB color codes (currently only RGB value is checked but this could be enhanced to also check for websafe colors) mikeworks 2010-09-26 22:28:32 +00:00
  • d5dc88a351 shop cleanup button only if servlet was called without post/put arguments. This should avoid confusion after a search for a word where it is possible to delete the word. If a delete button is shown to delete the word, then there should not be a button available to delete the whole index to avoide a wrong usage when a user searches only for a word to delete it. orbiter 2010-09-26 21:11:08 +00:00
  • afa708d552 *) added <s>...</s> tag to WikiCode -> works just as the HTML equivalent *) code changes (PMD) without functional changes low012 2010-09-26 12:57:07 +00:00
  • a83186ac7d fix for bug in cytrails orbiter 2010-09-26 10:32:40 +00:00
  • b019426811 de.lng: Added German translations for new Index Creation pages RSS Feeds and adapted text in Tables_p.html and CrawlStartExpert_p.html to match some typos, also changed one name tag to id to conform with XHTML 1.0 Strict mikeworks 2010-09-26 01:39:51 +00:00
  • 48c0d508ac fixes for crawling of smb links (file length not always available) orbiter 2010-09-25 22:32:26 +00:00
  • 965a40b623 de.lng: Added some German translation for Admin Console -> Advanced Settings and started to add some translations for new Index Creation pages mikeworks 2010-09-25 21:53:13 +00:00
  • 0bc6284e27 - added bugfix for access tracker in case of concurrency conflicts - added missing entry for new icu4j path in Mac App orbiter 2010-09-25 21:10:50 +00:00
  • 6295cac35a set reasonable mime-type for icu4j-core.jar f1ori 2010-09-25 20:04:34 +00:00
  • e670e1ef8e add charset auto-detection for htmlParser f1ori 2010-09-25 18:02:21 +00:00
  • ddcd5ae78c fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=2989 f1ori 2010-09-25 15:07:25 +00:00
  • 8fe1102452 fix http://forum.yacy-websuche.de/viewtopic.php?p=20889#p18426 reuse code from htmlParser f1ori 2010-09-24 13:14:20 +00:00
  • daeea96aea renamed servlet CrawlStart_p.html to CrawlStartSite_p.html to circumvent problem with translation which still showed old expert crawl start page orbiter 2010-09-22 21:46:31 +00:00
  • 10a9cb1971 simplified snippet computation process and separated the algorithm into two classes also enhances selection criteria for best snippet line computation orbiter 2010-09-22 20:50:02 +00:00
  • 4450c240b7 npe fix http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2982 lotus 2010-09-22 20:24:07 +00:00
  • 84a023cbc8 fixed several search bugs orbiter 2010-09-21 21:48:42 +00:00
  • 937dd956d3 save default number of search items via web interface lotus 2010-09-21 19:45:49 +00:00
  • 4e8cf0c72c added a search box and navigation to api steering servlet orbiter 2010-09-21 13:10:18 +00:00
  • 09c208a3ab patch for corrupted database files (just work on and forget key) orbiter 2010-09-20 14:38:56 +00:00
  • 97ee278931 enhanced search speed: - better control of number of running search threads - no time-out waiting time when no ranking feeding takes place - local search queries by a remote peer may be faster up to 300 milliseconds - a local search may even be faster orbiter 2010-09-20 13:17:25 +00:00
  • ee3820c9cc more logging for strange "java.lang.NoClassDefFoundError: de/anomic/http/server/RequestHeader" error orbiter 2010-09-20 11:01:44 +00:00
  • b392ca5024 * add option to show YaCy version, usage: java -cp lib/yacycore.jar net.yacy.yacy -version f1ori 2010-09-20 10:21:05 +00:00
  • ac73072924 added a demonstration class: integrate the YaCy search results in own applications This class requests a YaCy peer remotely and produces search result objects. The class was implemented in such a way that it is as short as possible. To get a better integration of search results, use the cora package. This class is fully stand-alone, it does not need any other external library other than already contained in JRE. orbiter 2010-09-20 09:57:36 +00:00
  • 377f001e0d sorting of crawl profile names in crawl profile editor, see http://forum.yacy-websuche.de/viewtopic.php?p=20851#p20851 orbiter 2010-09-20 09:09:38 +00:00
  • 3552476fbe terminated migration from apache httpclient-3.1 to 4.1: - remove the library - added two classes from the httpclient-3.1 library as source code to YaCy because these classes were used by the YaCy HTTP Server - modified the added classes ChunkedInputStream and ContentLengthInputStream in such a way that: * there are no more dependencies to httpclient-3.1 * these classes had been simplified to serve only the purpose for the YaCy httpd orbiter 2010-09-20 08:36:48 +00:00
  • 8da4eb5de6 addition to patch in SVN 7111 orbiter 2010-09-19 23:12:50 +00:00
  • a2f9974745 some redesign in the access tracker to realize sixcoolers question about "smartes way for deleting the first Object": - not so much abstraction for a collection, makes use of remove() (no operands) possible - different way to delete elements in track (destructive, not constructive (less copies of elements in new queue)) - more abstraction for class api since no static class must be used any more orbiter 2010-09-19 23:00:24 +00:00
  • 03f0414025 some minor correction of my last commit sorry for the noise sixcooler 2010-09-19 20:57:25 +00:00
  • 42fa0eadb1 fix endless loop: Collection does not support remove(int) (isn't there a smartes way for deleting the first Object?) sixcooler 2010-09-19 20:41:44 +00:00
  • 5a9ea0308f *) further simplification of wiki code parser (less redundancy in code, less magic numbers), still not done with it... low012 2010-09-18 11:40:37 +00:00
  • f32bb5e51f *) Changed image in Steering.html from linked image to embedded image because shutdown is so fast now, browsers can't load image before Yacy instance is gone already. Had to make image smaller since IE does not accept large Base64 encoded images. *) Decreases wait time in Steering.html before first check since *) HTML fixes low012 2010-09-18 00:18:52 +00:00
  • 37baa8bae3 - fixes for concurrency exceptions and failed database integrity verification - added link to yacystats peer when peer is more than one day old orbiter 2010-09-17 10:20:04 +00:00
  • 29fe401f93 - some layout and text enhancement for site crawl start - Quix0rs patch from http://forum.yacy-websuche.de/viewtopic.php?p=20839#p20839 (parts) orbiter 2010-09-16 23:00:07 +00:00
  • 8c1da27347 - added more comments for user in site crawl servlet - added a disable/enable function in case that 'sitemap' is selected for functions that do (not) apply - better naming of menu items - limit default crawl depth orbiter 2010-09-16 22:04:14 +00:00
  • 3f958ccc7e fixed localization for renamed expert crawl start servlet orbiter 2010-09-16 21:29:37 +00:00