Commit Graph

  • fc3ee9d8a1 changed paths for new libs also in Info.plist when changing paths in build.xml, please also update the paths in /addon/YaCy.app/Contents/Info.plist orbiter 2010-09-27 14:49:51 +00:00
  • f4357dff03 bump to httpclient-4.0.3 which fixes a number of bugs sixcooler 2010-09-27 13:24:40 +00:00
  • ad7efe6016 rssTerminal.html: Fixing the 'null' is null or not an object in rss2.js when viewing the YaCy default Status page http://localhost:8080/Status.html with Internet Explorer feed.xml: copy of feed.rss that helps Internet Explorer also read the Feed - workaround for the fix above Problem is described in the forums and should be fixed better ;-(http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2766&p=20702) mikeworks 2010-09-26 22:55:52 +00:00
  • 190de644dd de.lng: Added German translations for some missing table content on Network view WatchWebStructure_p.html: Added JS verification of RGB color codes (currently only RGB value is checked but this could be enhanced to also check for websafe colors) mikeworks 2010-09-26 22:28:32 +00:00
  • d5dc88a351 shop cleanup button only if servlet was called without post/put arguments. This should avoid confusion after a search for a word where it is possible to delete the word. If a delete button is shown to delete the word, then there should not be a button available to delete the whole index to avoide a wrong usage when a user searches only for a word to delete it. orbiter 2010-09-26 21:11:08 +00:00
  • afa708d552 *) added <s>...</s> tag to WikiCode -> works just as the HTML equivalent *) code changes (PMD) without functional changes low012 2010-09-26 12:57:07 +00:00
  • a83186ac7d fix for bug in cytrails orbiter 2010-09-26 10:32:40 +00:00
  • b019426811 de.lng: Added German translations for new Index Creation pages RSS Feeds and adapted text in Tables_p.html and CrawlStartExpert_p.html to match some typos, also changed one name tag to id to conform with XHTML 1.0 Strict mikeworks 2010-09-26 01:39:51 +00:00
  • 48c0d508ac fixes for crawling of smb links (file length not always available) orbiter 2010-09-25 22:32:26 +00:00
  • 965a40b623 de.lng: Added some German translation for Admin Console -> Advanced Settings and started to add some translations for new Index Creation pages mikeworks 2010-09-25 21:53:13 +00:00
  • 0bc6284e27 - added bugfix for access tracker in case of concurrency conflicts - added missing entry for new icu4j path in Mac App orbiter 2010-09-25 21:10:50 +00:00
  • 6295cac35a set reasonable mime-type for icu4j-core.jar f1ori 2010-09-25 20:04:34 +00:00
  • e670e1ef8e add charset auto-detection for htmlParser f1ori 2010-09-25 18:02:21 +00:00
  • ddcd5ae78c fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=2989 f1ori 2010-09-25 15:07:25 +00:00
  • 8fe1102452 fix http://forum.yacy-websuche.de/viewtopic.php?p=20889#p18426 reuse code from htmlParser f1ori 2010-09-24 13:14:20 +00:00
  • daeea96aea renamed servlet CrawlStart_p.html to CrawlStartSite_p.html to circumvent problem with translation which still showed old expert crawl start page orbiter 2010-09-22 21:46:31 +00:00
  • 10a9cb1971 simplified snippet computation process and separated the algorithm into two classes also enhances selection criteria for best snippet line computation orbiter 2010-09-22 20:50:02 +00:00
  • 4450c240b7 npe fix http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2982 lotus 2010-09-22 20:24:07 +00:00
  • 84a023cbc8 fixed several search bugs orbiter 2010-09-21 21:48:42 +00:00
  • 937dd956d3 save default number of search items via web interface lotus 2010-09-21 19:45:49 +00:00
  • 4e8cf0c72c added a search box and navigation to api steering servlet orbiter 2010-09-21 13:10:18 +00:00
  • 09c208a3ab patch for corrupted database files (just work on and forget key) orbiter 2010-09-20 14:38:56 +00:00
  • 97ee278931 enhanced search speed: - better control of number of running search threads - no time-out waiting time when no ranking feeding takes place - local search queries by a remote peer may be faster up to 300 milliseconds - a local search may even be faster orbiter 2010-09-20 13:17:25 +00:00
  • ee3820c9cc more logging for strange "java.lang.NoClassDefFoundError: de/anomic/http/server/RequestHeader" error orbiter 2010-09-20 11:01:44 +00:00
  • b392ca5024 * add option to show YaCy version, usage: java -cp lib/yacycore.jar net.yacy.yacy -version f1ori 2010-09-20 10:21:05 +00:00
  • ac73072924 added a demonstration class: integrate the YaCy search results in own applications This class requests a YaCy peer remotely and produces search result objects. The class was implemented in such a way that it is as short as possible. To get a better integration of search results, use the cora package. This class is fully stand-alone, it does not need any other external library other than already contained in JRE. orbiter 2010-09-20 09:57:36 +00:00
  • 377f001e0d sorting of crawl profile names in crawl profile editor, see http://forum.yacy-websuche.de/viewtopic.php?p=20851#p20851 orbiter 2010-09-20 09:09:38 +00:00
  • 3552476fbe terminated migration from apache httpclient-3.1 to 4.1: - remove the library - added two classes from the httpclient-3.1 library as source code to YaCy because these classes were used by the YaCy HTTP Server - modified the added classes ChunkedInputStream and ContentLengthInputStream in such a way that: * there are no more dependencies to httpclient-3.1 * these classes had been simplified to serve only the purpose for the YaCy httpd orbiter 2010-09-20 08:36:48 +00:00
  • 8da4eb5de6 addition to patch in SVN 7111 orbiter 2010-09-19 23:12:50 +00:00
  • a2f9974745 some redesign in the access tracker to realize sixcoolers question about "smartes way for deleting the first Object": - not so much abstraction for a collection, makes use of remove() (no operands) possible - different way to delete elements in track (destructive, not constructive (less copies of elements in new queue)) - more abstraction for class api since no static class must be used any more orbiter 2010-09-19 23:00:24 +00:00
  • 03f0414025 some minor correction of my last commit sorry for the noise sixcooler 2010-09-19 20:57:25 +00:00
  • 42fa0eadb1 fix endless loop: Collection does not support remove(int) (isn't there a smartes way for deleting the first Object?) sixcooler 2010-09-19 20:41:44 +00:00
  • 5a9ea0308f *) further simplification of wiki code parser (less redundancy in code, less magic numbers), still not done with it... low012 2010-09-18 11:40:37 +00:00
  • f32bb5e51f *) Changed image in Steering.html from linked image to embedded image because shutdown is so fast now, browsers can't load image before Yacy instance is gone already. Had to make image smaller since IE does not accept large Base64 encoded images. *) Decreases wait time in Steering.html before first check since *) HTML fixes low012 2010-09-18 00:18:52 +00:00
  • 37baa8bae3 - fixes for concurrency exceptions and failed database integrity verification - added link to yacystats peer when peer is more than one day old orbiter 2010-09-17 10:20:04 +00:00
  • 29fe401f93 - some layout and text enhancement for site crawl start - Quix0rs patch from http://forum.yacy-websuche.de/viewtopic.php?p=20839#p20839 (parts) orbiter 2010-09-16 23:00:07 +00:00
  • 8c1da27347 - added more comments for user in site crawl servlet - added a disable/enable function in case that 'sitemap' is selected for functions that do (not) apply - better naming of menu items - limit default crawl depth orbiter 2010-09-16 22:04:14 +00:00
  • 3f958ccc7e fixed localization for renamed expert crawl start servlet orbiter 2010-09-16 21:29:37 +00:00
  • 58b7417a59 - added a new 'easy' crawl start menu which can be used for the special case of loading a complete domain - the previous crawl start servet was renamed to CrawlStartExpert_p - easy crawl start is now default orbiter 2010-09-16 12:02:43 +00:00
  • 461a2a6ec7 enhanced remote crawling: - 300 ppm is default now (but this is switched off by default; if you switch it on you may want more traffic?) - better timing for busy queue - better amount of remote url retrieval - better time-out values - better tracking of availability of remote crawl urls - more logging for result of receipt sending orbiter 2010-09-16 09:34:17 +00:00
  • 670ba4d52b - removed the remote crawl option from the network configuration submenu and - added a remote crawl menu item to the index create menu. This menu also shows a list of peers that provide remote crawl urls - set remote crawl option by default to off. This option may be important but it also confuses first-time users orbiter 2010-09-16 00:39:05 +00:00
  • 89c2d8b81e better initial hash computation orbiter 2010-09-15 22:11:52 +00:00
  • 34e2f7f487 enhanced snippet fetch strategy: concurrent snippet fetch even for offline-snippet searches. This improves speed since it is now possible to fetch snippets offline and parsing of source files from the htcache can be enhanced using concurrency. This improves local and remote search. orbiter 2010-09-15 21:09:14 +00:00
  • 0cf006865e refactoring and enhanced concurrency orbiter 2010-09-15 11:38:03 +00:00
  • 83ac07874f - corrected return value of put() methods (not used anywhere, so it did not harm before) - added use of LookAheadIterator which should prevent mistakes when coding iterators with embedded iterators - added a fail-safe reaction in case of database corruption using iterators over database elements (no interruption then) orbiter 2010-09-15 10:43:14 +00:00
  • f9a27a05e5 migrated to log4j 1.2.16 orbiter 2010-09-15 09:18:35 +00:00
  • 5c67e6ca49 migrated to latest apache commons fileupload 1.2.2 orbiter 2010-09-15 08:54:41 +00:00
  • 5702419194 fixed a bug in HTTPClient: keep-alive must be set to false, otherwise servers hold connections 2 seconds open until response. orbiter 2010-09-14 22:25:35 +00:00
  • 5870b13f3a - code cleanup / added debug line for further investigation in HTTPDemon.parseMultipart - changed data structure for sorting in search which performs better in that specific case (too many updates) orbiter 2010-09-14 21:03:50 +00:00
  • ac1c08924e more performance hacks orbiter 2010-09-14 15:27:27 +00:00
  • 14c843d364 more performance hacks orbiter 2010-09-14 15:00:34 +00:00
  • 39f409a7bb performance hacks orbiter 2010-09-14 14:32:24 +00:00
  • 7ebef56add - redesign of a part of the remote search client to make it possible to have a test environment for remote search performance tests - added a remote search test main methods in yacyClient orbiter 2010-09-14 13:35:47 +00:00
  • 2e75879504 fix for latest commit orbiter 2010-09-14 13:01:18 +00:00
  • 6e4653cf50 remove DoS protection in remote search for intranet hosts orbiter 2010-09-14 12:38:05 +00:00
  • 3c0e07ba72 removed all delays in shutdown process orbiter 2010-09-14 09:13:28 +00:00
  • 906c572621 - enhanced index create menu structure - clear search log caches each time a search is done orbiter 2010-09-14 09:06:27 +00:00
  • fc924f024e import of oai sources from a list using a command line interface: if you have a list of oai servers you can import them all using the linux command: bin/importOAIList.sh <name-of-oai-list-file> orbiter 2010-09-13 10:13:34 +00:00
  • 64860dc1bb enhanced search event logging (to be used for further improvements) orbiter 2010-09-13 09:33:04 +00:00
  • 7dbc357593 patch to identify corrupted database files orbiter 2010-09-13 07:20:53 +00:00
  • 17eebd4ef8 counting crawler traffic again: fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2808 sixcooler 2010-09-11 15:58:15 +00:00
  • 547d5226ae fix banner reload parameters (were no html errors) adapted default colours lotus 2010-09-11 11:05:07 +00:00
  • d2a3d08c44 avoid div. by zero lotus 2010-09-11 10:58:33 +00:00
  • 2c7edea35e - better shutdown behavior for the GUI (waits until data is written if GUI is killed) - release 0.97 orbiter 2010-09-10 12:47:24 +00:00
  • 34a25856a5 - added navigation to next/prev search page using arrow keys (left/right) - better information text for YaCy GUI application orbiter 2010-09-10 10:42:01 +00:00
  • 32f73d1aaa added copy for Info.plist for Mac application release updates (this file contains class paths and start parameters) orbiter 2010-09-10 09:48:09 +00:00
  • 5ce679a053 focus search field on load, no click necessary anymore lotus 2010-09-10 08:45:26 +00:00
  • 013926f01c added 'francais' as language option for default configuration orbiter 2010-09-10 08:35:47 +00:00
  • 4c21d8dc9d - changed default values for online caution (the pausing may not be necessary any more) - fixed bug in WeakPriorityBlockingQueue - show favicon faster using pre-loading (same technique as used for fast image search) orbiter 2010-09-09 23:25:19 +00:00
  • 570ca577c6 performance hacks orbiter 2010-09-09 22:42:54 +00:00
  • 348dece62f redesign of the SortStack and SortStore classes: created a WeakPriorityBlockingQueue as special implementation of a PriorityBlockingQueue with a weak object binding. - better abstraction of ordering technique - fixed some bugs according to result numbering (distinguish different counters in Queue) - fixed a ordering bug in post-ranking (ordering was decreased instead of increased) - reversed ordering numbering using a reversed ordering. The higher the ranking number the better (now). orbiter 2010-09-09 15:30:25 +00:00
  • 03eb021568 Fix for byte[] Objects as keys hermens 2010-09-09 14:41:20 +00:00
  • 114bdd8ba7 fixed old sitemap importer which was not able to parse urls containing post elements - removed old parser - removed old importer framework (was only used by removed old parser) - added a new sitemap parser in parser framework - linked new parser with parser access in old sitemap processing routines orbiter 2010-09-08 14:13:15 +00:00
  • b73ea6581d fix json in case of query includes " lotus 2010-09-08 11:54:25 +00:00
  • c0b08ac59b slighlty changed way of pdf parser integration orbiter 2010-09-08 07:32:47 +00:00
  • 6d83c7cb62 removed unnecessary Override statements (produces errors in strict validation) orbiter 2010-09-08 07:15:41 +00:00
  • 6a09f1f7e5 fix dedicated upnp testing lotus 2010-09-07 18:17:23 +00:00
  • 848ef6937e licenses for pdfbox orbiter 2010-09-07 17:17:08 +00:00
  • 5fe828fa06 - replaced pdfbox and fontbox version 1.1.0 with 1.2.1 - added some clear statements that shall clear static cache size within the pdfbox library - the pdfbox library contains a memory leak; it is unsafe to run a peer with pdf parser permanently on. orbiter 2010-09-07 17:13:47 +00:00
  • c757a4aa9f - corrected lifetime computation for search events - made search event cache cleanup concurrent because cleanup may cause index modifications orbiter 2010-09-06 16:05:19 +00:00
  • ec8f1c0446 adapted _debug.bat to regular starter script lotus 2010-09-06 13:36:57 +00:00
  • 5dff8f62c4 fix tray information display for non-windows lotus 2010-09-06 13:30:40 +00:00
  • 24502fe3de performance hacks orbiter 2010-09-06 12:59:33 +00:00
  • a6c2e9ef0c add file.encoding=UTF8 to debian init script f1ori 2010-09-06 12:53:03 +00:00
  • ffaa9a1c51 avoiding double-loading of the same resource from the web in case that a seond attempt to load the resource is started while the first attempt is still loading the content from the web. This will delay the second attempt to the time when the first attempt has finished with the possible result that the second attempt reads only from the web cache, not from the web. This will also enhance the process of image result display from SVN 7105 orbiter 2010-09-06 10:23:30 +00:00
  • fb828f3767 - performance enhancements in search response time using faster query ID computation and an ID cache - code cleanup orbiter 2010-09-06 10:00:07 +00:00
  • 0ab6a462ee - added a missing entry in YaCy interface robots.txt for bookmarks - changed default robots.txt deny list to include some more interface pages because the loading of such pages are a peer load issue for YaCy when crawlers come by and information on these pages are not useful for public search. orbiter 2010-09-06 09:58:54 +00:00
  • d865ef77a8 removed re-read of index in case of a bad index. This may not solve the problem but it applies a 100% CPU problem on the peer. I'm afraid bad index files must be abandoned, and cannot be fixed this way. orbiter 2010-09-06 09:55:04 +00:00
  • b2c9db48ea Performance enhancement - introduced byte[] - based ARC method for MapHeap which avoids a String generation each time the cache is accessed - bugfixing in required class ComparableARC orbiter 2010-09-06 09:53:33 +00:00
  • ae07e11bc5 enhanced image search result display: concurrent loading of images before they are displayed orbiter 2010-09-05 23:02:46 +00:00
  • 72a096fccb using -XX:-UseGCOverheadLimit when starting java. see also: http://forum.yacy-websuche.de/viewtopic.php?p=20709#p20709 orbiter 2010-09-05 14:19:02 +00:00
  • 22047ffad5 enhanced computation speed of many replaceAll string operations orbiter 2010-09-05 13:19:42 +00:00
  • e8228fba09 less locking in time format computation, caching and during secondary (remote) search evaluation orbiter 2010-09-05 11:13:12 +00:00
  • 9c0c94683c because of a bug in search result caching count search results had not been generated as fast as possible. with this fix search results are (even) faster. Also enhanced: image search. This is now speeded up using a image search result look-ahead orbiter 2010-09-04 22:57:12 +00:00
  • fa2eb9676e removed unused class orbiter 2010-09-04 21:45:33 +00:00
  • 5f391fcfa9 *) cleaned up in wikiCode parser (more to be done) *) HTML fixes low012 2010-09-04 14:01:34 +00:00
  • b3f0d06444 fixed a problem with restarts in YaCy mac applications: the DATA directory path was not submitted when doing a restart. This solves the problem by: - storing the startup properties when yacy is started - using the properties in the restart-script again. this transports also the DATA directory location as parameter of the -gui option that is used when the Mac version of YaCy is started orbiter 2010-09-03 23:08:43 +00:00
  • d4e4967e19 cleaned up code in yacyRelease (there will be work to do there) orbiter 2010-09-03 22:35:48 +00:00
  • 7be988768d simple selection of views in ViewFile.html (omit usage of button) orbiter 2010-09-03 22:35:07 +00:00
  • 2cb8cf5b41 added overall utf-8 default support for mac releases orbiter 2010-09-03 22:34:08 +00:00