Commit Graph

  • 9977fb9cf5 more enhancements to Network servlet: own peer in overview orbiter 2010-05-21 23:50:39 +00:00
  • bfdb9f4e06 extended statistics on Network servlet page - added number of online peers at the last day and the last week - changed design of statistic table - network picture now shows exactly those peers that are counted in the statistic overview for one day orbiter 2010-05-21 23:27:32 +00:00
  • 431852f0a7 testing new 'seach on map' image (slightly larger) orbiter 2010-05-21 13:12:47 +00:00
  • e40542579e fixes for wrong attribut name search->query (SRU) orbiter 2010-05-21 13:02:35 +00:00
  • 903ff21478 increased default time-out orbiter 2010-05-21 09:09:26 +00:00
  • 98c1d65415 - show up to 10 locations (maps) after search (instead of a max of 5) - order locations by (primary) population and (secondary) longitude (reverse ordering, both) - added population from GeoNames, OpenGeoDB does not have that information - changed default viewpoint of map to (30,15); shows more land and europe in the center orbiter 2010-05-21 08:18:04 +00:00
  • 9842fab6e4 - fixes to query parameter - replaced/removed search query attribute (was old style, new is 'query' according to SRU) orbiter 2010-05-20 22:05:04 +00:00
  • 6ec9ced4cd - fix for multi-word search for locations - changed description text to 'title' entity (subject is a list of keywords and was very messed) - added ViewFile in location pop-up orbiter 2010-05-20 15:07:57 +00:00
  • 7f35e1955e Added alt tag and width and height properties to earthsearch.png in yacysearchtrailer.html for HTML validity Added alt tag to page tabs in yacysearch.java for HTML validity Added new German translations for geo search phrase in de.lng mikeworks 2010-05-20 06:36:02 +00:00
  • 1defd580bc - added option to localization search to distinguish between a search for a location according to the search word only or for the relation between a web search results and locations found in the metadata fields - used that to display two layers on map: cities and search result locations - added many marker grafics for the display of the markers on the map - some refactoring of the yacy news code plus bugfixes for latest move from Tree to Table data structure orbiter 2010-05-19 12:53:09 +00:00
  • ad823a4716 *) minor changes (only cosmetics, no functional changes) low012 2010-05-18 21:31:59 +00:00
  • dcac90d2f9 *) removed unnecessary import low012 2010-05-18 21:09:41 +00:00
  • 1e8c6cefae - added 'search on map' - Link to search result page - added default search option to location search - show default search in search window on location search page - added icon for location search orbiter 2010-05-18 14:48:54 +00:00
  • 227ebc6651 - added more map layers to the new location search: openstreetmap (mapnik, osmarender, cycle map) - cycle map is default because it looks best at 'world view' - added control elements to map - increased map size - added deletion of search results for each time when a new search is done - moved search box up and added yacy icon in such a way that the search page looks exaclty the same as the standard search orbiter 2010-05-18 13:52:15 +00:00
  • bd0a9df895 fix for bad location double check orbiter 2010-05-18 11:54:30 +00:00
  • b7556893c6 removed terminate buttons for build-in crawl profiles in crawl profile editor orbiter 2010-05-18 07:08:01 +00:00
  • a5ec7db4ab *) Oops! low012 2010-05-17 19:44:02 +00:00
  • b02078b58c *) added visualization of GeoRSS search (very basic, but it's a start...) *) removed double code low012 2010-05-17 19:39:41 +00:00
  • bd2587954a ? orbiter 2010-05-16 01:05:45 +00:00
  • e43e61e502 added another geolocalization data source: GeoNames - added downloader option in DictionaryLoader - added generalization (interfaces and overarching localization) - more abstraction using the libraries orbiter 2010-05-15 23:49:30 +00:00
  • c9862e0ca9 *) removed unnecessary imports low012 2010-05-15 01:17:45 +00:00
  • 76aea981ec *) added W3C geo GeoRSS (see http://en.wikipedia.org/wiki/Georss) low012 2010-05-15 01:11:03 +00:00
  • 118d589eff replaced the very very old data structure 'Records' with a simple table to fix the problem from http://forum.yacy-websuche.de/viewtopic.php?p=20066#p20066 orbiter 2010-05-15 00:59:02 +00:00
  • 734298facd *) added missing namespace declarations in yacysearch.rss *) minor change in yacysearchtrailer.java low012 2010-05-15 00:09:53 +00:00
  • 2a8f70f0ca - fix for caching of OSM tiles. if you want that this fix applies to your peer, please delete the crawl profiles - fix for initial generation of crawl profiles (one more reason to remove your crawl profiles) - more String -> byte[] migration - more logging for cache store/hit orbiter 2010-05-14 23:50:07 +00:00
  • 2126c03a62 - removed download-limit that can be given for the crawler for non-crawler download tasks. This was necessary because the same procedure was used for other downloads like for the download of dictionary files where a limit is not useful. The limit still stays for the indexer - migrated the opengeodb downloader to a new version of the opengeodb-dump orbiter 2010-05-14 18:30:11 +00:00
  • 3661cb692c added dictionary loader servlet that can be used to get the geolocalization file: /DictionaryLoader_p.html Will also be used for more dictionary files in the future orbiter 2010-05-14 09:52:53 +00:00
  • 90fa8fd4d4 - support gpx file extension - non-blocking location search (time-out handling was wrong) orbiter 2010-05-12 08:49:20 +00:00
  • b0927d26e0 *) fix for "more options" link *) removed suplus code low012 2010-05-12 00:48:24 +00:00
  • 439b44be9e removed exit from computation in ReferenceContainerArray.get merge method an warning is still given, but method computes at normal operation see also: http://forum.yacy-websuche.de/viewtopic.php?p=20038#p20038 orbiter 2010-05-11 23:36:40 +00:00
  • 7b880d73d0 adjustments to granted query size orbiter 2010-05-11 23:28:43 +00:00
  • 4cd56d3966 - fix for http://forum.yacy-websuche.de/viewtopic.php?p=20036#p20036 - enhancement to kml search orbiter 2010-05-11 23:06:39 +00:00
  • 586bc4d920 - remove superfluous entries in remote search tracker handles - avoid concurrent access from same client this is a fix for http://forum.yacy-websuche.de/viewtopic.php?p=20045#p20045 orbiter 2010-05-11 22:26:18 +00:00
  • 789c6b26ce added a location search service: using the following servlet/example: http://localhost:8080/yacysearch_location.kml?query=berlin&maximumTime=2000&maximumRecords=100 orbiter 2010-05-11 12:58:05 +00:00
  • f23cbd2dab more bugfixes to date parser orbiter 2010-05-11 11:32:46 +00:00
  • cf43bdc87e This is a large bugfix and enhancement commit to support a better location detection for data - fixes to http file server session handling - fixes and enhancements to metadata date/time handling - added dc:publisher metadata field and updated all document parser - fixed bug in metdata read procedure - enhanced dublin core and rss parser to understand more fields more properly - enhanced url selection in case that multiple urls are given in surrogates - fix for condenser; failure when last word does not end with termination symbol orbiter 2010-05-11 11:14:05 +00:00
  • 6eba2cb96b fix in bmp parser orbiter 2010-05-09 13:27:58 +00:00
  • c6d9a12a99 fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=2810 lotus 2010-05-09 11:21:11 +00:00
  • c45117f81f fixed dates in metadata orbiter 2010-05-08 22:09:36 +00:00
  • 0a5fd15703 :-( orbiter 2010-05-06 22:06:31 +00:00
  • ac16f582aa fix for http://forum.yacy-websuche.de/viewtopic.php?p=20017#p20017 orbiter 2010-05-06 22:04:30 +00:00
  • a7d038bb7a The oai ListFriends source list becomes configurable: just write them into defaults/oaiListFriendsSource.xml orbiter 2010-05-06 10:01:37 +00:00
  • cf13c65bdd added another network definition file for the open access (decentral OAI) search network orbiter 2010-05-05 22:47:03 +00:00
  • 06ff0c5b06 fixes for metadata retrieval and presentation orbiter 2010-05-05 22:45:54 +00:00
  • 5c5e6accdb Fixes for (X)HTML compatibility. suessthomas 2010-05-05 21:12:58 +00:00
  • 5d87379be6 fix for yacy search urls in compare search orbiter 2010-05-05 15:44:40 +00:00
  • 31f30f0925 added compare search for OAIster and oai.yacy.net orbiter 2010-05-05 15:32:36 +00:00
  • 7ab207d93a better presentation of search result metadata and fixes to htcache loading orbiter 2010-05-04 20:57:09 +00:00
  • 5fbf866cae - fixed resumption token generation for oai-pmh import - relaxed dublin core parsing: the dc:reference tag may replace dc:identifier if this does not contain a valid url - parsing of completeRecords number and presentation in the download list of oai import orbiter 2010-05-02 22:20:24 +00:00
  • af13a02307 YaCySearchhtmlFF.html - fixes for XHTML compatibility. suessthomas 2010-05-02 21:03:40 +00:00
  • 5efc0dce0b fix for domain options in search box orbiter 2010-04-30 21:53:20 +00:00
  • fc5efcc05a enhanced and fixed OAI-PMH import - now importing OAI-PMH server list fron two sources - simultanous import from several servers (even > 2000) - check buttons on OAI-PMH server list to select multiple servers for import start - it is possible to select all servers at once for import - imported XML data is gzipped after import from surrogate reader orbiter 2010-04-30 14:03:51 +00:00
  • c2098f9399 close unused connections if there to many for DHT sixcooler 2010-04-29 23:38:50 +00:00
  • 455a763d7c performance hacks orbiter 2010-04-28 08:38:57 +00:00
  • 40a8d132d9 tried to fix 100% CPU when calling Balancer.top() see also: http://forum.yacy-websuche.de/viewtopic.php?p=19978#p19978 orbiter 2010-04-27 22:37:50 +00:00
  • b6cce08019 fixed a bug in rwi storage data size allocation orbiter 2010-04-27 22:22:16 +00:00
  • 90c3e5d6f6 - cleanup, removed unused imports - added crawling queue sizes to /api/status_p.xml, syntax same as in queues_p.html - fixed a bug in queue enumeration that caused a out of bounds exception orbiter 2010-04-27 21:47:41 +00:00
  • 3aad50d38e :-( orbiter 2010-04-26 15:26:08 +00:00
  • 9edd38fbc5 connectionCount limit too low? orbiter 2010-04-26 15:24:47 +00:00
  • 7a05db0fcb fixed to prevent that too many open connections exist - create less connections at maximum (smaller httpc connection pool size) - create less connections per host (2, standard required by RFC) - do not start DHT distributions if there are too many open connections - clear open/idle connections earlier; run cleaner more often orbiter 2010-04-25 23:08:36 +00:00
  • a9b9bf667b fix for http://forum.yacy-websuche.de/viewtopic.php?p=19910#p19910 orbiter 2010-04-25 21:48:30 +00:00
  • b18a7606a0 some performance hacks and fixed after reading dump in http://forum.yacy-websuche.de/viewtopic.php?p=19920#p19920 orbiter 2010-04-25 21:37:36 +00:00
  • 2bc3cba6f1 - fix for 'do not write to cache' rule. - do not read from cache if byte[] array is still filled from response object (will do less IO) orbiter 2010-04-24 08:22:45 +00:00
  • 4cd5418963 removed finalize methods because of a hint in http://java.sun.com/javase/6/webnotes/trouble/TSG-VM/html/memleaks.html#gbyvh orbiter 2010-04-23 09:32:29 +00:00
  • bfa35d6d20 possible fix for ZURL.list counter orbiter 2010-04-23 08:46:47 +00:00
  • b7efbe55cf i guess these documents are here by mistake orbiter 2010-04-23 07:03:31 +00:00
  • cff8ed134f added index check to prevent blocking in synchronization orbiter 2010-04-22 22:16:38 +00:00
  • 65f383e70b some adjustments to the httpc after testing with a very slow httpd orbiter 2010-04-22 22:10:19 +00:00
  • 5ab5ac80fe fix for NPE in TextParser orbiter 2010-04-21 22:35:47 +00:00
  • b95ae2518b fix for assert orbiter 2010-04-21 17:59:22 +00:00
  • 3247f0e901 fix for deadlocks caused by self-blocking access to TreeMap in concurrent environments. The TreeMap was replaced by a ConcurrentHashMap and additional care that the strings are compared all in lowercase orbiter 2010-04-21 13:46:02 +00:00
  • 027b971bde fix for concurrent quicksort: catch jobs from ThreadPoolExecutor that had been rejected because of full processing queues. Non-catched jobs may have been the cause for blockings and freezes in case of overloading during strong processing orbiter 2010-04-21 13:44:59 +00:00
  • 8c40f1cb8e self-healing for broken table files (may cause other problems, but better than nothing) orbiter 2010-04-21 11:29:27 +00:00
  • 13f5b8e7ba fix for storing/getting bookmark-folders called by Quix0r sixcooler 2010-04-20 17:55:59 +00:00
  • 7b69d79727 enhanced remove() operation: in many cases it is not necessary to return the removed object to the called. for such cases the delete() operation was introduced which is sometimes much cheaper in operation since it does not need to create objects to hold the removed content and it does not need to read those objects. orbiter 2010-04-20 14:47:41 +00:00
  • 93ea0a4789 enhanced remove operation in search consequences (which are triggered when the snippet fetch proves that the word has disappeared from the page that was stored in the index) - no direct deletion of referenced during search (shifted to time after search) - bundling of all deletions for the references of a single word into one remove operation - enhanced remove operation by caring that the collection is stored sorted (experimental) - more String -> byte[] transition for search word lists - clean up of unused code - enhanced memory allocation of RowSet Objects (will use a little bit less memory which was wasted before) orbiter 2010-04-20 13:45:22 +00:00
  • 7a59012632 fix for NPE orbiter 2010-04-20 07:43:48 +00:00
  • 1a6c2f77b4 fix for NPE in statistic servlet orbiter 2010-04-20 00:08:43 +00:00
  • 64f29f990e a collection of performance hacks and code cleanup: - removed usage of URL-Caches which could have been a memory leak - removed unused classes and methods - removed not necessary synchronizations - added synchronization hacks where possible - fine-tuned crawling speed to prevent IO of balancer - fixed a bug in IODispatcher that may have caused that no merges were done - reduced number of parameters in very often called methods (compare methods) - reduced complexity of data structures of now massively used HandleSet class - reduction of new String() and getBytes() usage / new methods to support this transition orbiter 2010-04-19 16:42:37 +00:00
  • 8b8107b2a3 reduced IO-load and synchronization/blocking - enhanced the Balancer performance when building new domain stacks using a new Table buffer - added the new Table buffer BufferedObjectIndex class - changed order of access to LURL-read (prefereing segment over Crawl Queues) will reduced blocking time on balancer - fixed PPM setting in Crawler_p servlet (had doubled values) - reduced synchronization in IndexCell because it is not necessary: reduced blocking during indexing/merging/dumping - removed did-you-mean cache in IndexCell because that caused too much overhead and more memory usage but was not very useful. This reduced also deadlocks that could be causes when searched are performed during indexing. orbiter 2010-04-18 21:55:20 +00:00
  • 7417425e6a Added (b)log notice in header of table in Network.html Added some missing translations for German de.lng in Network.html page mikeworks 2010-04-17 08:16:42 +00:00
  • ed07046870 flush only when > 3000 RWIs present + code cleanup orbiter 2010-04-16 16:07:19 +00:00
  • 3a50b5aa04 enhanced object hash computation orbiter 2010-04-15 14:19:29 +00:00
  • 1a8a134e0c continuing String-hash - to - byte[]-hash redesign that was started in SVN 6775 and continued in SVN 6790 The result should be a less usage of new String() and less memory usage (since a String-encapsulated byte[] has 40 bytes overhead) orbiter 2010-04-15 13:22:59 +00:00
  • dde394a977 - shifted some computation out of synchronization to allow more concurrency - removed synchronization where not necessary orbiter 2010-04-14 23:22:06 +00:00
  • f204076d25 removed usage of temporary files: causes too much IO orbiter 2010-04-14 22:17:18 +00:00
  • 48b9371735 changed balancer re-load counter. causes less blocking here doing intranet indexing. orbiter 2010-04-14 20:57:26 +00:00
  • 650be3599f added a time-out to the RWI cache to flush the cache if it has not been written for ten minutes. This additional dump criteria is necessary because some data sources repeat their vocabulary and may cause that the number of words in a RWI does not increase while the number of references in the RWI set increases. Now the RWI Buffer is flushed every 10 minutes or later if at that time already a dump is ongoing. orbiter 2010-04-14 20:30:34 +00:00
  • ff6cf24b80 replaced RowSetArray in ObjectIndexCache with RowSet to reduce complexity in MergeIterator. This complexity caused too much computing overhead when the RowSetArray had become very large. orbiter 2010-04-14 19:26:51 +00:00
  • 0d04ab1422 new access tracking data type strategy; previous data types may have caused deadlocks of httpd when performing statistic cleanups orbiter 2010-04-14 16:18:04 +00:00
  • fc43f3028e *) hopefully fixing NPE issue introduced in r6797 low012 2010-04-13 23:33:50 +00:00
  • 55d8e686ea performance hacks orbiter 2010-04-13 23:29:55 +00:00
  • 2f181d0027 introduced concurrency in HTCACHE storage compression orbiter 2010-04-13 16:22:09 +00:00
  • 2e26744f4e more concurrency when normalizing RWI entries + cleanup orbiter 2010-04-13 14:47:57 +00:00
  • 555b333041 fix for wrong count of server processes. may fix non-access problems in some cases orbiter 2010-04-13 14:34:16 +00:00
  • 5d8238680f better thread dump statistic evaluation in threaddump servlet when creating multiple dumps for profiling: no distinguishing between thread states orbiter 2010-04-13 12:19:09 +00:00
  • aa083fc45c try to get a fix for OOM problem in case that there is no real problem with missing memory. See also http://forum.yacy-websuche.de/viewtopic.php?p=19835#p19835 orbiter 2010-04-13 11:39:54 +00:00
  • 70e6222978 more concurrency during search requests orbiter 2010-04-13 11:12:36 +00:00
  • 4917f96729 fixes for some changes in SVN 6797 that caused NPEs when the bookmarks initialized orbiter 2010-04-13 10:14:08 +00:00
  • dff660441a *) changes for better code readability low012 2010-04-13 01:31:16 +00:00