Commit Graph

  • a3eebfdcba YMarks: - show active/running crawls - execute crawls (works currently only if API entry is available) - various smaller fixes apfelmaennchen 2011-11-17 23:11:27 +00:00
  • c50f8f9a06 code cleanup orbiter 2011-11-17 13:40:22 +00:00
  • b51c3fade4 Merge branch 'master' of https://git.gitorious.org/yacy/rc1.git sixcooler 2011-11-17 03:50:36 +01:00
  • 84c3fc9d97 local/global fixes in search, better abstraction orbiter 2011-11-17 01:05:45 +00:00
  • aca0f33f08 enhancements for extended search options orbiter 2011-11-17 00:19:14 +00:00
  • 4f95f72124 YMarks: - working direct importer for YaCy Crawl Starts - working direct import for old bookmarks.db apfelmaennchen 2011-11-16 23:10:53 +00:00
  • a635e43f40 fix for global search attribute when selecting extended search options orbiter 2011-11-16 22:57:15 +00:00
  • 29c2289b5c Merge branch 'master' of https://git.gitorious.org/yacy/rc1.git sixcooler 2011-11-16 17:15:18 +01:00
  • 605bc4c10e Merge branch 'master' of https://git.gitorious.org/yacy/rc1.git sixcooler 2011-11-16 16:56:09 +01:00
  • aa322bc6d0 fix orbiter 2011-11-16 15:36:30 +00:00
  • 97d1347adb added also a default accept field to robots.txt downloads orbiter 2011-11-16 15:33:55 +00:00
  • f183d3822c added a default accept header in http requests since some http fraud detection functions check that this header field exist see also: http://bad-behavior.ioerror.us/ in source file browser.inc.php orbiter 2011-11-16 15:27:43 +00:00
  • 06352b8d6b more logging orbiter 2011-11-16 14:09:50 +00:00
  • a99934226e more logging for debugging of robots.txt orbiter 2011-11-16 13:56:31 +00:00
  • 7a5841e061 fix for robot parser orbiter 2011-11-16 13:12:46 +00:00
  • 458c20ff72 fix for robot parser orbiter 2011-11-16 13:06:46 +00:00
  • e7dedc56f2 Merge branch 'master' of https://git.gitorious.org/yacy/rc1.git sixcooler 2011-11-16 11:13:03 +01:00
  • 787f6ef039 Merge branch 'master' of https://git.gitorious.org/yacy/rc1.git sixcooler 2011-11-16 02:05:11 +01:00
  • 017a01714d - enhanced logging in robots.txt parser for remote debugging - robots.txt is now more robust against database operations orbiter 2011-11-16 01:03:49 +00:00
  • 7545822db5 Merge branch 'master' of https://git.gitorious.org/yacy/rc1.git sixcooler 2011-11-16 01:59:48 +01:00
  • 5a7cec59f3 moved ynetSearch to get all files out of htroot/api/util/ orbiter 2011-11-16 00:21:56 +00:00
  • a410cfd7f3 - flexigrid images didn't load last time apfelmaennchen 2011-11-15 21:55:00 +00:00
  • a8dfe787ed - updated to jquery flexigrid 1.1 - YMarks.html automatically recognizes if a bookmark is a crawl start apfelmaennchen 2011-11-15 21:45:17 +00:00
  • 710ea9fcb9 Merge branch 'master' of https://git.gitorious.org/yacy/rc1.git sixcooler 2011-11-15 18:32:09 +01:00
  • cef8ebc41d getpageinfo: Checks if there is a OAI repository behind the URL. This check is only performed if oai parameter is set when calling e.g. getpageinfo_p.xml?actions=oai cominch 2011-11-15 12:22:19 +00:00
  • 0aa5e134ea Merge branch 'master' of https://git.gitorious.org/yacy/rc1.git sixcooler 2011-11-15 02:31:27 +01:00
  • eb1c7c041d write info about robots.txt evaluation into getpageinfo_p.xml orbiter 2011-11-15 00:33:54 +00:00
  • f8b8c82421 - refactoring of getpageinfo_p.xml (moved out of util) - added more logging in getpageinfo_p.xml orbiter 2011-11-15 00:22:40 +00:00
  • abba31f02e - bugfix for correctly sorting ymarks - some tuning for the autotagger (still not perfect) - /api/ymarks/get_metadata.xml now provides info for crawlstarts - removed unused code apfelmaennchen 2011-11-14 22:00:44 +00:00
  • ff32469272 added a link to /api/util/getpageinfo_p.xml as API to crawl start info and to ViewFile.html orbiter 2011-11-14 20:19:41 +00:00
  • 3b70ff7046 Merge branch 'master' of https://git.gitorious.org/yacy/rc1.git sixcooler 2011-11-14 19:25:30 +01:00
  • 3a15e58e28 - increased stability when opening the robots table - increased stability when deleting tables orbiter 2011-11-14 15:33:35 +00:00
  • 775b44017e refactoring orbiter 2011-11-14 15:11:57 +00:00
  • c99a4c0920 Merge branch 'master' of https://git.gitorious.org/yacy/rc1.git sixcooler 2011-11-14 14:07:58 +01:00
  • e914a30099 fix for npe orbiter 2011-11-14 12:32:15 +00:00
  • b92c6bf897 Trying ImageIO instead of awt-Toolkit for parsing sixcooler 2011-11-14 12:37:11 +01:00
  • db5ef90b0f Merge branch 'master' of https://git.gitorious.org/yacy/rc1.git sixcooler 2011-11-14 12:22:57 +01:00
  • 69dcde5cc6 not checking for the pid-file sixcooler 2011-11-14 12:21:36 +01:00
  • 9f8240b350 script for clean copy of URL-tables sixcooler 2011-11-14 12:20:59 +01:00
  • 5c58eda45a custom start-script sixcooler 2011-11-14 12:20:33 +01:00
  • f40fef8243 custom logging settings sixcooler 2011-11-14 12:19:58 +01:00
  • 7cf8fac83f some filtering sixcooler 2011-11-14 12:19:27 +01:00
  • 3ef9f301ba some customize on Memory-Performance-Graph sixcooler 2011-11-14 12:16:07 +01:00
  • 8f25070460 weekly rewrite of blobs sixcooler 2011-11-14 12:14:07 +01:00
  • d6c1ab4e0f some more unreserved characters sixcooler 2011-11-14 12:11:22 +01:00
  • f522f61af0 clean offline copy of URL Tables sixcooler 2011-11-14 12:09:34 +01:00
  • ee2f8673a2 memory in Perfomance Graph - just like it was in the past sixcooler 2011-11-14 12:08:01 +01:00
  • 2a6712e4be sixcooler.de in seed-list bootstrap locations sixcooler 2011-11-14 12:06:05 +01:00
  • 54193457bc cutom keep alive strategy sixcooler 2011-11-14 11:54:48 +01:00
  • 249a78ff2a G1 Memory Strategy - not used now sixcooler 2011-11-14 11:54:03 +01:00
  • ccf1583188 cutom keep alive strategy sixcooler 2011-11-14 11:52:29 +01:00
  • f280e339a8 no force on Memory Request for these parser sixcooler 2011-11-14 11:46:30 +01:00
  • 5f7dbe1c42 - some refactoring (ymarks) - improvement for autotagger (is now able to create/detect multi word tags e.g. 'open source') apfelmaennchen 2011-11-13 23:19:47 +00:00
  • 6567244f2a git testing: sixcooler 2011-11-12 12:26:42 +01:00
  • 2f03186252 - small bug fix apfelmaennchen 2011-11-12 09:25:08 +00:00
  • 1e55e50c49 - removed unused code from search widget - added more comments for documentation - ALT key now submits global search - various smaller bug fixes apfelmaennchen 2011-11-11 23:18:02 +00:00
  • f0820a9d02 - more improvements for search widget (portalsearch) - added proper error handling - greatly increased robustness - greatly increased usability of navigators - some smaller speed improvements apfelmaennchen 2011-11-10 23:18:58 +00:00
  • 78ce3b13be typo orbiter 2011-11-10 11:57:26 +00:00
  • 9067ab20b2 - included missing image for portalsearch.tar.gz in build.xml - compressed (minify) yacy-portalsearch.js for better performance - removed language selector, as it doesn't work really well (at least for me) apfelmaennchen 2011-11-10 09:13:58 +00:00
  • c7d117505c - portalsearch.js some fixes for paths, when remote loading method is used apfelmaennchen 2011-11-10 08:54:55 +00:00
  • a90a72a76b - some smaller changes to search widget apfelmaennchen 2011-11-09 23:37:35 +00:00
  • a425fbd8d6 - created new target 'portalsearch' in build.xml to generate yacy-portalsearch.tar.gz for static hosting - some refactoring for search widget and jquery - update for ConfigLiveSearch.html to refelct latest changes apfelmaennchen 2011-11-09 21:01:38 +00:00
  • 42425c8003 fixed directDocByURL (has now effect if switched off) orbiter 2011-11-09 15:54:01 +00:00
  • 85d6bf4ac4 fixed urls to media content during indexing orbiter 2011-11-09 15:40:14 +00:00
  • 0d858d48ec replaced String with StringBuilder in suggestion process orbiter 2011-11-09 14:42:55 +00:00
  • d871812621 fix for http://bugs.yacy.net/view.php?id=68 as well as for a far more serious bug in navigator handling in the portal search widget. Navigators are now quite usable, but the GUI has still some flaws... apfelmaennchen 2011-11-08 23:01:22 +00:00
  • 3a807e10cf - added a cache for active crawl profiles to the crawl switchboard - moved the domain cache for domain counter from the crawl switchboard to the crawl profiles. the crawl domain counter is now therefore relative for each crawl start, not for the whole crawler. orbiter 2011-11-08 15:38:08 +00:00
  • 37e35f2741 normalization of url using urlencoding/decoding orbiter 2011-11-08 12:02:22 +00:00
  • e58438c01c - added a new retry connector for solr (for cases where solr responses are slow) - added a new exist property into the metadataRepository which includes solr entries orbiter 2011-11-08 11:49:04 +00:00
  • 62e674af50 fix for http://bugs.yacy.net/view.php?id=69 apfelmaennchen 2011-11-08 11:13:29 +00:00
  • 4d7ae76017 - update to jquery 1.7 (does not apply to all jquery code, old version is additionally kept for compatibility) - update to jquery-ui 1.8.16 (includes themes) - introduced new portalsearch (as default) - old portalsearch is still available and accessible, but will eventually be removed - jquery and portal search is now loaded by special header templates for maintenance reasons - update to new autocomplete, solves bug: http://bugs.yacy.net/view.php?id=29 - many improvements to YMarks GUI and API...more to come anytime soon apfelmaennchen 2011-11-07 20:44:58 +00:00
  • 887f088dad The IP address of the YaCy-Demo portal added to Whitelist. This is only a temporary workaround. suessthomas 2011-11-03 23:44:49 +00:00
  • d8d9735b4f stability bugfix orbiter 2011-11-03 14:41:38 +00:00
  • c31564ef08 stability bugfixes orbiter 2011-11-03 14:34:58 +00:00
  • f121f4bb45 fix for link in Supporter and Suftipps page orbiter 2011-11-01 22:49:14 +00:00
  • 94eab08794 - updated opensearchdescription text and icon - removed automatic setting of maxitems during search (can be set now elsewhere) - updated RSSMessage.java orbiter 2011-10-30 01:09:38 +00:00
  • ba41a869a7 set default number of search results in ConfigPortal.html orbiter 2011-10-29 09:22:03 +00:00
  • 279482a76d fix for npe orbiter 2011-10-29 08:45:43 +00:00
  • d260b25457 fix for npe orbiter 2011-10-29 07:28:24 +00:00
  • 2adc30d335 suppressing size if size unknown orbiter 2011-10-27 23:21:39 +00:00
  • 1b86d06d1e fix for http://bugs.yacy.net/view.php?id=62 orbiter 2011-10-26 10:07:16 +00:00
  • e09e27b1ac Win installer: remove Berlios redirect to updated JRE, link is now hardcoded again, JRE update lotus 2011-10-23 19:53:51 +00:00
  • 1b8b989744 *) set maxlength of input field for country code filter to value > default text length (old value caused warning in Opera) low012 2011-10-22 16:37:56 +00:00
  • 9e4875230f performance hacks orbiter 2011-10-20 23:06:49 +00:00
  • eb9c9edb01 enhanced table method (used by almost all yacy api interfaces) orbiter 2011-10-18 23:38:19 +00:00
  • 4ad9fc2bff new snippet strategy for search hits in metadata: show beginning of text instead of hit position orbiter 2011-10-13 00:34:52 +00:00
  • b5b09b329c BOOSTED the image search function. The result page now shows the images as embedded image link from the original source and not from the built-in image buffering and re-sizing servlet. The result is shown much faster now not because YaCy does not need to re-size the images but for a very strange other reason: because of RFC specification (http://tools.ietf.org/html/rfc2616#section-8.1.4) a browser does not open more than two connections to the same server at the same time. If the YaCy image servlet is used, then the target host is the YaCy host for all images and that prevents a parallel computation of the image loading. orbiter 2011-10-12 22:59:58 +00:00
  • a9838f8b99 fix for http://bugs.yacy.net/view.php?id=59 orbiter 2011-10-12 22:26:48 +00:00
  • d3df03838a make sure myself-target is always inserted at its appropriate position this was previously omitted if the own peer should have been the first target or the peer was the last peer before the rotation to AAAAAAAAAAAA hermens 2011-10-10 15:23:37 +00:00
  • c3e7efa846 added sender side prevention of rwi flooding as mentioned in SVN 7993 saves memory and speeds up enqueueContainers by limiting the size of transfer.Chunk saves network bandwidth by not transmitting RWIs that would get discarded at the target anyway hermens 2011-10-10 14:35:03 +00:00
  • 5af9598bd1 enhanced exported row parsing during row import this affects the search and dht receive speed orbiter 2011-10-10 09:46:38 +00:00
  • 204e98db3a added a protection against rwi flooding orbiter 2011-10-10 01:10:49 +00:00
  • 7598a9e26b fix for thread dump orbiter 2011-10-07 23:23:49 +00:00
  • 3f606407bc added new scripts to bin in build orbiter 2011-10-07 22:57:20 +00:00
  • 8eef8722d1 update to ThreadDump analysis: freerunner and thread state recognition orbiter 2011-10-07 22:53:14 +00:00
  • 1df43b137d another performance hack orbiter 2011-10-06 23:35:14 +00:00
  • 7df0643f0e performance hacks orbiter 2011-10-06 23:31:04 +00:00
  • a7df70221e refactoring orbiter 2011-10-04 09:06:24 +00:00
  • 1b45e33f04 added robots tag parser to solr scheme orbiter 2011-09-30 13:39:01 +00:00
  • cf4fd525ee added directDocByURL attribute in crawl profile orbiter 2011-09-30 12:38:28 +00:00