Commit Graph

  • b92c6bf897 Trying ImageIO instead of awt-Toolkit for parsing sixcooler 2011-11-14 12:37:11 +01:00
  • db5ef90b0f Merge branch 'master' of https://git.gitorious.org/yacy/rc1.git sixcooler 2011-11-14 12:22:57 +01:00
  • 69dcde5cc6 not checking for the pid-file sixcooler 2011-11-14 12:21:36 +01:00
  • 9f8240b350 script for clean copy of URL-tables sixcooler 2011-11-14 12:20:59 +01:00
  • 5c58eda45a custom start-script sixcooler 2011-11-14 12:20:33 +01:00
  • f40fef8243 custom logging settings sixcooler 2011-11-14 12:19:58 +01:00
  • 7cf8fac83f some filtering sixcooler 2011-11-14 12:19:27 +01:00
  • 3ef9f301ba some customize on Memory-Performance-Graph sixcooler 2011-11-14 12:16:07 +01:00
  • 8f25070460 weekly rewrite of blobs sixcooler 2011-11-14 12:14:07 +01:00
  • d6c1ab4e0f some more unreserved characters sixcooler 2011-11-14 12:11:22 +01:00
  • f522f61af0 clean offline copy of URL Tables sixcooler 2011-11-14 12:09:34 +01:00
  • ee2f8673a2 memory in Perfomance Graph - just like it was in the past sixcooler 2011-11-14 12:08:01 +01:00
  • 2a6712e4be sixcooler.de in seed-list bootstrap locations sixcooler 2011-11-14 12:06:05 +01:00
  • 54193457bc cutom keep alive strategy sixcooler 2011-11-14 11:54:48 +01:00
  • 249a78ff2a G1 Memory Strategy - not used now sixcooler 2011-11-14 11:54:03 +01:00
  • ccf1583188 cutom keep alive strategy sixcooler 2011-11-14 11:52:29 +01:00
  • f280e339a8 no force on Memory Request for these parser sixcooler 2011-11-14 11:46:30 +01:00
  • 5f7dbe1c42 - some refactoring (ymarks) - improvement for autotagger (is now able to create/detect multi word tags e.g. 'open source') apfelmaennchen 2011-11-13 23:19:47 +00:00
  • 6567244f2a git testing: sixcooler 2011-11-12 12:26:42 +01:00
  • 2f03186252 - small bug fix apfelmaennchen 2011-11-12 09:25:08 +00:00
  • 1e55e50c49 - removed unused code from search widget - added more comments for documentation - ALT key now submits global search - various smaller bug fixes apfelmaennchen 2011-11-11 23:18:02 +00:00
  • f0820a9d02 - more improvements for search widget (portalsearch) - added proper error handling - greatly increased robustness - greatly increased usability of navigators - some smaller speed improvements apfelmaennchen 2011-11-10 23:18:58 +00:00
  • 78ce3b13be typo orbiter 2011-11-10 11:57:26 +00:00
  • 9067ab20b2 - included missing image for portalsearch.tar.gz in build.xml - compressed (minify) yacy-portalsearch.js for better performance - removed language selector, as it doesn't work really well (at least for me) apfelmaennchen 2011-11-10 09:13:58 +00:00
  • c7d117505c - portalsearch.js some fixes for paths, when remote loading method is used apfelmaennchen 2011-11-10 08:54:55 +00:00
  • a90a72a76b - some smaller changes to search widget apfelmaennchen 2011-11-09 23:37:35 +00:00
  • a425fbd8d6 - created new target 'portalsearch' in build.xml to generate yacy-portalsearch.tar.gz for static hosting - some refactoring for search widget and jquery - update for ConfigLiveSearch.html to refelct latest changes apfelmaennchen 2011-11-09 21:01:38 +00:00
  • 42425c8003 fixed directDocByURL (has now effect if switched off) orbiter 2011-11-09 15:54:01 +00:00
  • 85d6bf4ac4 fixed urls to media content during indexing orbiter 2011-11-09 15:40:14 +00:00
  • 0d858d48ec replaced String with StringBuilder in suggestion process orbiter 2011-11-09 14:42:55 +00:00
  • d871812621 fix for http://bugs.yacy.net/view.php?id=68 as well as for a far more serious bug in navigator handling in the portal search widget. Navigators are now quite usable, but the GUI has still some flaws... apfelmaennchen 2011-11-08 23:01:22 +00:00
  • 3a807e10cf - added a cache for active crawl profiles to the crawl switchboard - moved the domain cache for domain counter from the crawl switchboard to the crawl profiles. the crawl domain counter is now therefore relative for each crawl start, not for the whole crawler. orbiter 2011-11-08 15:38:08 +00:00
  • 37e35f2741 normalization of url using urlencoding/decoding orbiter 2011-11-08 12:02:22 +00:00
  • e58438c01c - added a new retry connector for solr (for cases where solr responses are slow) - added a new exist property into the metadataRepository which includes solr entries orbiter 2011-11-08 11:49:04 +00:00
  • 62e674af50 fix for http://bugs.yacy.net/view.php?id=69 apfelmaennchen 2011-11-08 11:13:29 +00:00
  • 4d7ae76017 - update to jquery 1.7 (does not apply to all jquery code, old version is additionally kept for compatibility) - update to jquery-ui 1.8.16 (includes themes) - introduced new portalsearch (as default) - old portalsearch is still available and accessible, but will eventually be removed - jquery and portal search is now loaded by special header templates for maintenance reasons - update to new autocomplete, solves bug: http://bugs.yacy.net/view.php?id=29 - many improvements to YMarks GUI and API...more to come anytime soon apfelmaennchen 2011-11-07 20:44:58 +00:00
  • 887f088dad The IP address of the YaCy-Demo portal added to Whitelist. This is only a temporary workaround. suessthomas 2011-11-03 23:44:49 +00:00
  • d8d9735b4f stability bugfix orbiter 2011-11-03 14:41:38 +00:00
  • c31564ef08 stability bugfixes orbiter 2011-11-03 14:34:58 +00:00
  • f121f4bb45 fix for link in Supporter and Suftipps page orbiter 2011-11-01 22:49:14 +00:00
  • 94eab08794 - updated opensearchdescription text and icon - removed automatic setting of maxitems during search (can be set now elsewhere) - updated RSSMessage.java orbiter 2011-10-30 01:09:38 +00:00
  • ba41a869a7 set default number of search results in ConfigPortal.html orbiter 2011-10-29 09:22:03 +00:00
  • 279482a76d fix for npe orbiter 2011-10-29 08:45:43 +00:00
  • d260b25457 fix for npe orbiter 2011-10-29 07:28:24 +00:00
  • 2adc30d335 suppressing size if size unknown orbiter 2011-10-27 23:21:39 +00:00
  • 1b86d06d1e fix for http://bugs.yacy.net/view.php?id=62 orbiter 2011-10-26 10:07:16 +00:00
  • e09e27b1ac Win installer: remove Berlios redirect to updated JRE, link is now hardcoded again, JRE update lotus 2011-10-23 19:53:51 +00:00
  • 1b8b989744 *) set maxlength of input field for country code filter to value > default text length (old value caused warning in Opera) low012 2011-10-22 16:37:56 +00:00
  • 9e4875230f performance hacks orbiter 2011-10-20 23:06:49 +00:00
  • eb9c9edb01 enhanced table method (used by almost all yacy api interfaces) orbiter 2011-10-18 23:38:19 +00:00
  • 4ad9fc2bff new snippet strategy for search hits in metadata: show beginning of text instead of hit position orbiter 2011-10-13 00:34:52 +00:00
  • b5b09b329c BOOSTED the image search function. The result page now shows the images as embedded image link from the original source and not from the built-in image buffering and re-sizing servlet. The result is shown much faster now not because YaCy does not need to re-size the images but for a very strange other reason: because of RFC specification (http://tools.ietf.org/html/rfc2616#section-8.1.4) a browser does not open more than two connections to the same server at the same time. If the YaCy image servlet is used, then the target host is the YaCy host for all images and that prevents a parallel computation of the image loading. orbiter 2011-10-12 22:59:58 +00:00
  • a9838f8b99 fix for http://bugs.yacy.net/view.php?id=59 orbiter 2011-10-12 22:26:48 +00:00
  • d3df03838a make sure myself-target is always inserted at its appropriate position this was previously omitted if the own peer should have been the first target or the peer was the last peer before the rotation to AAAAAAAAAAAA hermens 2011-10-10 15:23:37 +00:00
  • c3e7efa846 added sender side prevention of rwi flooding as mentioned in SVN 7993 saves memory and speeds up enqueueContainers by limiting the size of transfer.Chunk saves network bandwidth by not transmitting RWIs that would get discarded at the target anyway hermens 2011-10-10 14:35:03 +00:00
  • 5af9598bd1 enhanced exported row parsing during row import this affects the search and dht receive speed orbiter 2011-10-10 09:46:38 +00:00
  • 204e98db3a added a protection against rwi flooding orbiter 2011-10-10 01:10:49 +00:00
  • 7598a9e26b fix for thread dump orbiter 2011-10-07 23:23:49 +00:00
  • 3f606407bc added new scripts to bin in build orbiter 2011-10-07 22:57:20 +00:00
  • 8eef8722d1 update to ThreadDump analysis: freerunner and thread state recognition orbiter 2011-10-07 22:53:14 +00:00
  • 1df43b137d another performance hack orbiter 2011-10-06 23:35:14 +00:00
  • 7df0643f0e performance hacks orbiter 2011-10-06 23:31:04 +00:00
  • a7df70221e refactoring orbiter 2011-10-04 09:06:24 +00:00
  • 1b45e33f04 added robots tag parser to solr scheme orbiter 2011-09-30 13:39:01 +00:00
  • cf4fd525ee added directDocByURL attribute in crawl profile orbiter 2011-09-30 12:38:28 +00:00
  • c61e4cfd78 - fix for incomplete clear() in balancer - renamed Parser Errors to Rejected URLs orbiter 2011-09-30 10:27:14 +00:00
  • 813f297a95 another performance hack: re-use of known host addresses for isLocal property; avoids look-up in local hash orbiter 2011-09-30 08:26:31 +00:00
  • 035ebfbf3b - performance hacks (should affect the crawl balancer and reduce CPU load during crawl stack re-fill) - this may have also (good) performance side effects on other parts of YaCy orbiter 2011-09-30 07:57:50 +00:00
  • 9c131adeb6 show IP of crawled host and country in CrawlResults orbiter 2011-09-29 15:30:15 +00:00
  • b250e6466d implemented crawl restrictions for IP pattern and country lists orbiter 2011-09-29 15:17:39 +00:00
  • e207c41c8e * fix urlproxy for urls containing dolar signs f1ori 2011-09-29 12:53:55 +00:00
  • 3ac6fb0baf added dump check script orbiter 2011-09-28 21:18:49 +00:00
  • 57d5529a01 performance hacks orbiter 2011-09-28 21:16:40 +00:00
  • 5ad7f9612b added crawl settings for three new filters for each crawl: must-match for IPs (IPs that are known after DNS resolving for each URL in the crawl queue) must-not-match for IPs must-match against a list of country codes (allows only loading from hosts that are hostet in given countries) orbiter 2011-09-27 21:58:18 +00:00
  • 47a8c69745 added a new feature to MultiProtocolURIs to get the locale for each url: This is done using a new library InetAddressLocator.jar which is NOT added by default to YaCy because it is very old and with that library we will never get a debian package. However, some people want that functionality and it can be made available if the library is taken from http://javainetlocator.sourceforge.net/ and placed into the /lib directory where it will be found using reflection. The new feature will be used to extend the crawler steering. orbiter 2011-09-27 15:26:14 +00:00
  • 2c3161b4ac refactoring: RankingProcess -> RWIProcess ResultFetcher -> SnippetProcess orbiter 2011-09-26 21:42:28 +00:00
  • d2ea250d99 refactoring: - moved many classes from de.anomic to net.yacy - made more sub-packages for search classes orbiter 2011-09-25 16:59:06 +00:00
  • 42b5f09f68 *) this should fix a bug in snippet creation (also cleaned up a little bit) low012 2011-09-25 16:07:22 +00:00
  • 277b454a62 *) added comments *) minor refactoring low012 2011-09-25 13:16:52 +00:00
  • 6b22865dbc - removed some warinings - removed a dead update location orbiter 2011-09-24 01:58:54 +00:00
  • fabda9ad31 added script that can be used to delete a single url from the index call: bin/deleteurl.sh <url> orbiter 2011-09-21 23:33:44 +00:00
  • 0c6d95e57b - more tolerance against failure of table opening - more connections for solrj orbiter 2011-09-21 15:08:05 +00:00
  • 30d340563e fix in result count display orbiter 2011-09-21 11:01:01 +00:00
  • 4f31869c5a enhanced search result timing orbiter 2011-09-21 10:43:08 +00:00
  • 6b02b696b0 - add number of search results to end of rss and json output to reflect latest status of retrieval - distinguish search access with different verify state in access of search cache orbiter 2011-09-20 19:41:44 +00:00
  • 87e6abd168 * fix urls containing a port number in urlproxy f1ori 2011-09-20 15:02:15 +00:00
  • 97045022fa * pass cookies to Server Side Includes * User.html a bit more usable f1ori 2011-09-20 14:54:14 +00:00
  • 6fba6e7cee fix: follow link target setting on image search lotus 2011-09-18 16:59:01 +00:00
  • ce2a76d603 performance hack for search process orbiter 2011-09-16 10:00:51 +00:00
  • a6bb0f9af4 fixed missing menu entries in access tracker orbiter 2011-09-15 23:26:09 +00:00
  • aaf7a0feaa yet another cache strategy orbiter 2011-09-15 22:40:01 +00:00
  • 8a428d3e77 ensure termination of pdf parser to avoid deadlocking of other processes during search result preparation orbiter 2011-09-15 11:17:38 +00:00
  • 2c4a672fe2 bugfixes and performance hacks for tabe index orbiter 2011-09-15 11:17:02 +00:00
  • dad5b586a4 added a concurrent warmin-up of Table data structures. that should speed-up the start-up process but may also cause stronger CPU load at that time. orbiter 2011-09-15 10:01:21 +00:00
  • 734059d33e performance hacks orbiter 2011-09-14 23:34:05 +00:00
  • 23e81b28b2 synchronization enhancements orbiter 2011-09-14 21:19:02 +00:00
  • dd4635e323 patches orbiter 2011-09-14 20:11:27 +00:00
  • 65ab067491 migration to solrj 3.4.0 orbiter 2011-09-14 20:08:59 +00:00
  • ffd848c7a9 moved the log, memory, processes and the messages into a new computation monitor main menu item orbiter 2011-09-14 09:59:30 +00:00
  • ef72fdac79 added keyboard-based search result page navigation: - page-up or tab switches to next search result page - page-down switches to previous search result page orbiter 2011-09-14 09:15:09 +00:00