Commit Graph

  • 9ef5a80f4e add interaction for triples and selector for augmented browsing cominch 2012-06-10 10:38:54 +02:00
  • 282c1620d6 Allow TripleStore to be persistent after reboot cominch 2012-02-13 18:04:22 +01:00
  • 5d20cd324a Add Triplestore and RDF query interface cominch 2012-06-10 10:35:59 +02:00
  • bc9a618e0a augmented browsing: ignore js and css, integrate more user interaction cominch 2012-06-10 10:29:15 +02:00
  • b21048892b augmentedParser add features and integrate external html parser to modify existing web pages cominch 2012-06-10 10:23:35 +02:00
  • 9cbfc1a1c0 augmentedProxy, which forwards every proxy request to a rewrite engine to customize existing webpages. originally implemented by Florian Richter. cominch 2012-06-10 10:15:34 +02:00
  • 1626be7916 Add menu entries for urlproxy / augmented browsing cominch 2012-01-19 14:24:17 +01:00
  • a32943b382 add json mimetype cominch 2012-01-18 10:14:33 +01:00
  • 3b992e6b00 using utf8 String compression in Webstructure database Michael Peter Christen 2012-06-09 11:00:33 +02:00
  • 26301a538d bugfix in Domains - dns-lookup Michael Peter Christen 2012-06-09 10:59:45 +02:00
  • cde20911bb saved a bit more ram using UTF8 String compression for OpenGeoDB and Geonames data files. Michael Peter Christen 2012-06-09 10:07:11 +02:00
  • 225ee42879 made the GeoLocation into an interface with the current integer implementation as accuracy implementation of 1.863cm Michael Peter Christen 2012-06-09 09:46:27 +02:00
  • 5b25272f40 added location search to main menu Michael Peter Christen 2012-06-09 09:10:54 +02:00
  • 2280a7b276 - changed initialization order to prefer allocation of memory for table files first - bugfixes in memory amount calculation Michael Peter Christen 2012-06-09 09:05:47 +02:00
  • 0746308bc2 only the metadata tables shall be able to use the tail cache Michael Peter Christen 2012-06-08 18:36:11 +02:00
  • 7ec9bef0c3 fix for OOM Michael Peter Christen 2012-06-08 17:14:09 +02:00
  • 41c02cb10e - less restrictions for usage of Table RAM copy - new limit to use the table copy (instead of flag): 400MB available. If less is available, then a copy is never used. If more is available, then it can be used if there is a remaining space of at least 200MB - flush caches more often: flush the Digest cache Michael Peter Christen 2012-06-08 12:48:25 +02:00
  • b8f56a9803 npe bugfix Michael Peter Christen 2012-06-08 10:20:43 +02:00
  • ea0dceb55d bugfix: do not switch off standard memory strategy when performing a forced GC PLEASE CHECK if your peer has standard memory switched on! Michael Peter Christen 2012-06-08 09:48:46 +02:00
  • dd14b19c26 lazy initialization of block rank table ... only normal web search uses this. When interactive search or location search is used, the block rank is switched off Michael Peter Christen 2012-06-08 09:41:29 +02:00
  • ba10caf89a lazy initialization of database tables Michael Peter Christen 2012-06-08 09:30:51 +02:00
  • 701b9a28a0 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git Michael Peter Christen 2012-06-08 09:16:16 +02:00
  • ab7107b34b fixed RWIProcess queue limits: now discovering hidden results for mass result retrieval Michael Peter Christen 2012-06-08 09:14:54 +02:00
  • 10c9c17d51 fixed handlemap spread factor and null iterator handling Michael Peter Christen 2012-06-08 09:13:41 +02:00
  • b0095c8d3c flush the compressor cache when a cleanup is done Michael Peter Christen 2012-06-07 19:42:33 +02:00
  • a61f44f9e4 lazy initialization of block rank table. this causes that the table is not initialized when there is no search is done. the effect is most strong if YaCy is started headless which causes no browser pop-up which otherwise would load the search page and therefore trigger the initialization of the table. Michael Peter Christen 2012-06-07 13:16:38 +02:00
  • c8bbd180e4 enhanced hint for debian package automatic update Michael Peter Christen 2012-06-07 12:36:26 +02:00
  • 9ad84c5e9f fix for NPE in PerformanceMemory Michael Peter Christen 2012-06-07 12:36:05 +02:00
  • 96e9d77270 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git Michael Peter Christen 2012-06-06 20:13:28 +02:00
  • 00f2df1120 a variety of possible memory leak fixes Michael Peter Christen 2012-06-06 18:23:18 +02:00
  • d7eb18cdf2 accept also file names beginning with "file://" for crawl start from file. Michael Peter Christen 2012-06-06 14:27:18 +02:00
  • 8002fd2578 use less cache space since a large cache would cause more memory usage in index files. Michael Peter Christen 2012-06-06 14:17:42 +02:00
  • 3dd8376825 added automatic cleaning of cache if metadata and file database size is not equal. It might happen that these data is different because one of that caches is cleaned after a while or when it is too big. The metadata is then not cleaned, but now wiped after a checkup process at every application start. This should cause a bit less memory usage. Michael Peter Christen 2012-06-06 14:15:24 +02:00
  • d0ec8018f5 fixes for bad long computation Michael Peter Christen 2012-06-06 14:13:31 +02:00
  • 6bb07afcc3 accept also files with other file prefix; used to read 'foreign' cache files Michael Peter Christen 2012-06-06 13:36:10 +02:00
  • 96c8119b50 added GeoLocation / GeoPoint classes which uses less memory than Location/Coordinates and has initializers with correct order of lat,lon coordinates Michael Peter Christen 2012-06-06 12:57:42 +02:00
  • 461a0ce052 removed warnings Michael Peter Christen 2012-06-05 20:03:43 +02:00
  • 62ae9bbfda allow more POIs, get more at once Michael Peter Christen 2012-06-05 18:29:54 +02:00
  • 407fdf6968 more bug fixes and performance hacks for search process Michael Peter Christen 2012-06-05 15:04:23 +02:00
  • a1fe65b115 performance hacks Michael Peter Christen 2012-06-05 12:06:26 +02:00
  • 2fe207f813 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git Michael Peter Christen 2012-06-04 23:44:38 +02:00
  • 5aee19daa4 added show from cache in search results (not yet finished) Michael Peter Christen 2012-06-04 23:44:26 +02:00
  • 5e562dcdb7 adopted vocabulary usage within anotation/naviagtion feature of search to new SimpleVocabulary class Michael Peter Christen 2012-06-04 23:43:30 +02:00
  • 514700291a moved Vocabulary to cora package (added in git 964406ad17) Michael Peter Christen 2012-06-04 23:41:36 +02:00
  • 0284a4d88f more fixes for double precision of coordinates Michael Peter Christen 2012-06-04 23:37:41 +02:00
  • 964406ad17 added concurrency enhancement to xml parser Michael Peter Christen 2012-06-04 23:35:56 +02:00
  • 240045cf7c fix for bad distance computation Michael Peter Christen 2012-06-04 16:33:16 +02:00
  • e0d8643226 - performance hacks - added log warnings in case that search processes run into time-out situations - better concurrency for Integer formatter (used a non-synchronized formatter before) - bugfix for search termination (a poison pill was missing) - added timeout parameters for search (again) -> target is, that they are never reached. Michael Peter Christen 2012-06-04 15:37:39 +02:00
  • 7a329465b3 using pre-compile pattern in blacklist; should enhance search speed Michael Peter Christen 2012-06-04 15:34:53 +02:00
  • cf79b6cee3 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git Michael Peter Christen 2012-06-01 08:32:26 +02:00
  • 6e83b02b83 - bugfix for surrogate file reader - bugfix for location search: suppress empty search Michael Peter Christen 2012-06-01 00:08:31 +02:00
  • 9b4c699526 ehanced location search: - search request are now made using a map boundary - search results are only computed for the map boundary - the number of results is adopted to the results in the visible range - added a double-buffering for the search result markers - added a search query option for the search results: /radius/<lat>/<lon>/<radius> Michael Peter Christen 2012-05-31 22:39:53 +02:00
  • 434af404c1 - added double-buffering for search layers - added automatic zooming to search result to location search Michael Peter Christen 2012-05-31 14:05:36 +02:00
  • 4d9b2dc487 automatically zoom to result layer bounds Michael Peter Christen 2012-05-31 01:12:06 +02:00
  • 6b40803adf - show number of results in map search interface - transfer view radius within query Michael Peter Christen 2012-05-31 00:47:52 +02:00
  • 834dc6b263 store more data from interface access Michael Peter Christen 2012-05-31 00:47:07 +02:00
  • 1f48d1528b performance hacks Michael Peter Christen 2012-05-31 00:46:30 +02:00
  • c70aaccdc9 better location to generate a guid for rss messages Michael Peter Christen 2012-05-30 17:14:25 +02:00
  • 10da7335ea performance hack: use a hash cache for all hashes that are computed by a byte array. If this hash is used in a HashMap (which is very often the case) then this hack eliminates a lot of re-computations of the same hash. Michael Peter Christen 2012-05-30 16:59:13 +02:00
  • f8a0cf6d7c RSSMessages do not need a concurrent hash map -> removed overhead Michael Peter Christen 2012-05-30 16:44:03 +02:00
  • 07ca7e4dd1 enhanced RSS parsing by ensuring that it is parsed with a buffered input stream Michael Peter Christen 2012-05-30 16:40:37 +02:00
  • 7c1feefb28 introduced a default 10 second time-out in rwi normalization time uring search process to prevent endless deadlocks after a very long running search Michael Peter Christen 2012-05-30 16:26:05 +02:00
  • 8d997d55b6 better logging Michael Peter Christen 2012-05-30 15:47:35 +02:00
  • 65d37e6a20 only ASCII needed in seed bitflags Michael Peter Christen 2012-05-30 15:42:28 +02:00
  • a8778e9c47 npe fix Michael Peter Christen 2012-05-30 15:28:45 +02:00
  • 0f82fb3628 using double instead float for a better release ordering Michael Peter Christen 2012-05-30 15:28:20 +02:00
  • 43c2c6e588 better logging Michael Peter Christen 2012-05-30 15:27:45 +02:00
  • ae173f4674 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git sixcooler 2012-05-30 14:47:32 +02:00
  • 56087c1f23 bump to httpclient- httpcore-, httpmime- 4.2 sixcooler 2012-05-30 14:46:21 +02:00
  • 168a7aa213 Windows Installer: update to Oracle Java 7u4 PCA42 2012-05-30 10:03:50 +02:00
  • 1a6fab60e0 added node state to xml Michael Peter Christen 2012-05-30 09:32:25 +02:00
  • 20e0cc0822 fix for bad location evaluation Michael Peter Christen 2012-05-29 14:46:13 +02:00
  • 2823e33531 fixes for wrong port number in shell scripts Michael Peter Christen 2012-05-29 13:08:27 +02:00
  • 1ab3de0885 fixes to location search Michael Peter Christen 2012-05-29 12:43:14 +02:00
  • 7acd7e88b3 added all shell scripts in /bin to add also latest passwd.sh file Michael Peter Christen 2012-05-29 12:00:32 +02:00
  • f167a1c69f removed osmarender from yacysearch_location because that caused a javascript error Michael Peter Christen 2012-05-29 02:22:02 +02:00
  • 71c3163f3d - fixes to node identification - added link to node in network list - added marking of portal search node peers Michael Peter Christen 2012-05-29 01:38:54 +02:00
  • d1e9fe3db5 enhanced RootState icon Michael Peter Christen 2012-05-29 00:06:33 +02:00
  • 4d3cc02168 replaced old bzip2 library against better documented commons-compress package from http://commons.apache.org/compress/ Michael Peter Christen 2012-05-28 23:53:48 +02:00
  • ad222be7f8 added node state icon in network list Michael Peter Christen 2012-05-25 17:29:54 +02:00
  • eff7667554 fix for http://bugs.yacy.net/view.php?id=188 Michael Peter Christen 2012-05-25 16:21:44 +02:00
  • 638390930d another patch to fix the Crawler_p layout Michael Peter Christen 2012-05-25 15:56:21 +02:00
  • 3c2bec681f added a root node flag: identifies peers with short ping time Michael Peter Christen 2012-05-25 15:33:02 +02:00
  • c846e9ca14 redesign of the crawler monitor page: show crawled pages instead of queue of urls that shall be crawled Michael Peter Christen 2012-05-25 01:45:38 +02:00
  • 8b974905ee changed log-in text for all servlets with authentication: - added hint how to set the password using a shell script - added a shell script to change the password Michael Peter Christen 2012-05-24 13:24:31 +02:00
  • 16b21f7a5b Added more steering in Crawler_p.html interface Michael Peter Christen 2012-05-23 18:00:37 +02:00
  • acc19e190d hack against 100% cpu during crawl delete Michael Peter Christen 2012-05-23 15:45:07 +02:00
  • c15fcde1c8 add-on to latest commit Michael Peter Christen 2012-05-21 17:52:30 +02:00
  • cf47d94888 performance hack to parse numbers inside of substrings without actually generating a substring. This avoids the allocation of a String object ech time a substring is parsed. Should affect CPU load during RWI transmission. Michael Peter Christen 2012-05-21 13:40:46 +02:00
  • 7e0ddbd275 added a "fromCache" flag in Response object to omit one cache.has() check during snippet generation. This should cause less blockings Michael Peter Christen 2012-05-21 03:03:47 +02:00
  • 81737dcb18 removed stack trace from swf parser since we cant do anything there Michael Peter Christen 2012-05-21 02:27:06 +02:00
  • 7bf421b9dd - fixed image search page navigation - removed some deadlocks and ConcurrentModificationExceptions during DidYouMean collection Michael Peter Christen 2012-05-21 01:58:29 +02:00
  • 125d47b3c1 added more interruptions in DidYouMean because that was the cause for some blockings during search Michael Peter Christen 2012-05-21 00:59:41 +02:00
  • c6a09eab0b synchronization needed Michael Peter Christen 2012-05-21 00:58:29 +02:00
  • fb94b47b1a changed queue sizes to have less memory occupied during indexing Michael Peter Christen 2012-05-21 00:19:03 +02:00
  • 76157dc2c3 bugfix for http://bugs.yacy.net/view.php?id=173 Michael Peter Christen 2012-05-21 00:18:00 +02:00
  • 6696cb1313 bugfix: lookup of peernames no result for active peer in page IndexControlRWIs_p.html -> Transfer RWI to other Peer SeedDB.lookupByName searche for lowercase peerNames, while MapColumnIndex.getIndex uses peername as is in the keyset. Changed the index init to insert lowercase peer names as key reger 2012-05-20 05:25:16 +02:00
  • c6558cba08 more classification bugs Michael Peter Christen 2012-05-20 02:59:47 +02:00
  • 4298f00d2d fixed bad usage of given words Michael Peter Christen 2012-05-20 01:35:49 +02:00
  • 082831b9d6 search contentdom was checked in wrong way - fixed Michael Peter Christen 2012-05-20 01:23:02 +02:00