Commit Graph

  • bcc2e7cb5b Merge branch 'master' of https://github.com/yacy/yacy_search_server luc 2015-11-03 09:29:57 +01:00
  • 2fcf6f104c fix bzipParser recognition - Bzip2Inputstream checks magic byte itself to identify bz2 (leave it in input) - try to suppy fitting mime for parsing bz2 content reger 2015-11-03 03:35:01 +01:00
  • 745e97a575 Merge branch 'master' of https://github.com/yacy/yacy_search_server luc 2015-11-02 08:10:11 +01:00
  • a60b1fb6c2 differentiate api call getLocalPort() from getConfigInt() reger 2015-10-31 23:09:03 +01:00
  • 02afba730e fix detection of https port changed after set in System Admin reger 2015-10-31 22:53:59 +01:00
  • 11f3666660 increase use of pre.defined CATCHALL_QUERY string reger 2015-10-31 19:44:31 +01:00
  • a58ee49307 Optimize internal imagequery focus on using content_type to select images (in favor of url file extension) reger 2015-10-31 19:18:46 +01:00
  • b61f91f0d4 Merge branch 'master' of https://github.com/yacy/yacy_search_server sixcooler 2015-10-30 18:47:42 +01:00
  • e90e1f165d Avoid returning an empty image when target encoding is not supported or when an error occured while encoding. luc 2015-10-30 16:20:28 +01:00
  • fc3294382e Updated javadocs for warning on target encoding format potential errors. luc 2015-10-30 16:19:05 +01:00
  • aa70ff4ff6 Corrected images alpha channel rendering luc 2015-10-30 05:18:16 +01:00
  • 2895ab552a Made ViewImagePerfTest extend ViewImageTest to ease automated image render tests luc 2015-10-29 23:24:39 +01:00
  • 5f1b1491d2 removed totalResults in json to prevent message 'Duplicate key "totalResults"' when parsing the json in JSONObject (json reference implementation) Michael Peter Christen 2015-10-29 18:41:58 +08:00
  • 4a03cf06e1 Corrected encoding extension arg parsing luc 2015-10-29 02:24:17 +01:00
  • 81f53fc83a upd readme.mediawiki min java version 1.7 reger 2015-10-26 22:19:20 +01:00
  • d223cf0ae4 adjust MediaWiki importer geo coordinate calculation - allow lat/long 0.xxx - south / west assignment include test class reger 2015-10-26 21:19:35 +01:00
  • 7e2723a894 Merge branch 'master' of https://github.com/yacy/yacy_search_server sixcooler 2015-10-26 19:23:25 +01:00
  • 2b775d5be6 fix typo in WikiCode coordinate calculation reger 2015-10-25 19:38:42 +01:00
  • a2dcf64039 fix IndexImportMediawiki_p servlet's refresh header add url parameter to make sure no parameter are included in refresh url which could cause unwanted restart of import job reger 2015-10-25 05:41:25 +01:00
  • bbe9df2bb3 fix MediawikiImporter for bz2 dump skip reading bz2 file magicbyte to identify bz2 format as inputstream reset would be required. Common compress reads and checks the magicbytes internally and throws ioexception if wrong, making preread obsolete. reger 2015-10-25 03:06:15 +01:00
  • c6687dd560 fix a system.out to log.fine in bmpParser reger 2015-10-25 00:26:45 +02:00
  • c720b4c249 remove override of dynamicField coordinate_p in solr schema (coordinate_p is not a mandatory field as such doesn't need to be declared as schema.field) reger 2015-10-24 22:44:28 +02:00
  • e53c6bbd51 fix init of peer flags (remove hiding of ssl flag) reger 2015-10-24 19:36:33 +02:00
  • 301ba6131a Merge branch 'master' of https://github.com/yacy/yacy_search_server sixcooler 2015-10-24 13:01:47 +02:00
  • ac034db8bc Merge branch 'master' of https://github.com/luccioman/yacy_search_server Michael Peter Christen 2015-10-24 11:22:35 +08:00
  • 8da20718aa Created a class to test ViewImage rendering against multiple image files. luc 2015-10-23 15:49:07 +02:00
  • ec04d27473 Corrected APNG test suite link name. luc 2015-10-23 14:12:00 +02:00
  • cbb84ba073 Detailed javadoc. luc 2015-10-23 13:57:24 +02:00
  • 70111876d2 Filled ViewImageTest.html with all remaining IANA image file formats. Added some links to test suites and specifications. luc 2015-10-23 12:27:52 +02:00
  • bfccb8db1c Merge branch 'master' of https://github.com/yacy/yacy_search_server sixcooler 2015-10-22 20:39:57 +02:00
  • 826f14f37f fix unnececary set null of peer flags, causing reread remove obsolete version flags reger 2015-10-22 02:35:58 +02:00
  • a156fd65d0 Patch to manage render or load errors is still needed after highlight.js version upgrade. Updated patch for better behavior consistency between browsers. luc 2015-10-22 00:36:34 +02:00
  • cdbafe340e Merge branch 'master' of https://github.com/yacy/yacy_search_server sixcooler 2015-10-21 08:42:36 +02:00
  • 37e28e0dd3 - Keep aspect ratio of images rendered directly by browser such as gif and svg. - Corrected quadratic rendering of landscape images with height smaller than maxHeight luc 2015-10-21 02:49:51 +02:00
  • 571609c208 upd javascript img viewerto highslide 4.1.13 reger 2015-10-21 02:14:04 +02:00
  • e2d00585e2 Display full size preview using ViewImage Servlet. luc 2015-10-20 01:17:37 +02:00
  • 74b0283d57 Added image preview error management. luc 2015-10-20 01:15:02 +02:00
  • 5902ce032e Corrected NullPointerException case when ImageIO reader is not found for image format. luc 2015-10-19 14:11:26 +02:00
  • f0b5bc93a3 remove obsolete yacy.init entry "secureHttps" not used anywhere reger 2015-10-19 03:47:28 +02:00
  • c4fa6d7bf5 upd to icu4j-56_1 reger 2015-10-19 01:06:51 +02:00
  • 5445f38070 upd to jetty 9.2.13.v20150730 reger 2015-10-19 00:53:10 +02:00
  • 6ca02ad577 upd httpclient-4.5.1, httpmime-4.5.1, httpcore-4.4.3, commons-compress-1.10 reger 2015-10-18 19:53:39 +02:00
  • c6495a5b62 add a log entry on parsing ajax crawling scheme snapshot (prev. commit 9252e36aeb) reger 2015-10-18 06:19:12 +02:00
  • 9252e36aeb implement ajax crawling scheme for ajax sites which adhere to the proposed use of hash-bangs to provide html content see freshly deprecated https://developers.google.com/webmasters/ajax-crawling/ Implementation improves parsing of the homepage (ajax page) which uses metatag "fragment" in header and parses supplied html snapshot instead of mostly empty ajax/scripted page. Implementation supports also hash-bang urls (url with anchor starting with ! like ...path#!hashfragment) but our crawler filters it (use of hash-bang is controversly discussed and proposal is deprecated, makes no sense to adjust the crawler, but as long as it is used by some sites the minor change/improvement in htmlparser is good for some time). Quick - how does it work - if metatag fragment with content "!" is found - htmlparser tries to get content of htmls snapshot (using a different url) - htmlparser returns 2 documents (original url and snapshot content - but using same original url) - after parsing result documents are joined (and stored to index containing content also from snapshot page... as the original ajax page contains typically no parseable html content) reger 2015-10-18 05:51:01 +02:00
  • d1ae999ef9 replaced HashMap with LinkedHashMap to preserve the object order Michael Peter Christen 2015-10-16 23:30:51 +02:00
  • 7d075a1d76 added log lines Michael Peter Christen 2015-10-16 23:30:04 +02:00
  • 092dac086e Merge branch 'master' of https://github.com/luccioman/yacy_search_server Michael Peter Christen 2015-10-16 23:22:30 +02:00
  • a44cc774d0 Merge branch 'master' of github.com:yacy/yacy_search_server Michael Peter Christen 2015-10-16 23:21:58 +02:00
  • 41c9215174 Merge branch 'master' of https://github.com/yacy/yacy_search_server sixcooler 2015-10-16 21:45:23 +02:00
  • 7a64bebb86 init Recrawl job chunk size to max crawl loader during job start, to use some system preferences and allow injection of recrawl urls before queue is empty During recrawl the balancer hangs on the very last urls often on hosts with huge delay time, by allowing injection earlier progress is more balanced. Max number of injected crawl urls by recrawl job is 2 * max loader. reger 2015-10-16 03:05:39 +02:00
  • e7dab60ebd Merge branch 'master' of https://github.com/yacy/yacy_search_server sixcooler 2015-10-15 19:54:42 +02:00
  • d6522fa4a2 Integrated haraldk/TwelveMonkeys library to first add TIF image format support. luc 2015-10-15 10:06:51 +02:00
  • e093fb228d Created a generic ViewImage performance render test. luc 2015-10-15 09:18:24 +02:00
  • 9244694e64 Merge branch 'master' of git@github.com:yacy/yacy_search_server.git Michael Peter Christen 2015-10-14 15:17:23 +02:00
  • 151ccd50a9 fix for image size field values (must be multi-valued) Michael Peter Christen 2015-10-14 15:16:16 +02:00
  • 3ad564e2e4 Created a ViewImage rendering performance measurement test. luc 2015-10-14 10:17:09 +02:00
  • 62e07a26a0 Refactoring : split into sub-functions to make it understanding and performance measurement easier. luc 2015-10-14 10:15:00 +02:00
  • b3f044072e Updated table headers and SVG file url for case sensitive OS. luc 2015-10-14 10:13:37 +02:00
  • ff963cbe23 Merge branch 'master' of https://github.com/yacy/yacy_search_server luc 2015-10-13 08:55:18 +02:00
  • c9937973e3 unescape MultiProtocolURL getAttributes() return values. use getAttributes() to get query parameters as clear text (w/o url encoding) use getSearchpartMap() to get in internal format (url encoded) reger 2015-10-13 02:43:18 +02:00
  • 6695e5cdd3 Merge branch 'master' of https://github.com/yacy/yacy_search_server sixcooler 2015-10-12 21:37:04 +02:00
  • 10b0eb106f fix link target on iframe list in CrawlProfileEditor reger 2015-10-11 06:06:40 +02:00
  • 78e8c6f3e5 refactor special handling (static override) of SUPPORTED_EXTENSIONS/MIME_TYPES not used for genericImageParser reger 2015-10-11 01:23:52 +02:00
  • d54c5d310a add links with image extension not automatically to image links. With the wide spread use e.g. of Wikimedia the url file extension of links with image extension often point to html. reger 2015-10-10 23:49:58 +02:00
  • f5746b5490 Added ico and bmp sample pictures luc 2015-10-06 20:48:09 +02:00
  • baede48161 Added JPEG 2000 and FITS samples luc 2015-10-06 09:53:09 +02:00
  • 7c9d80c5d0 Added image formats and informations for each format. luc 2015-10-06 09:51:47 +02:00
  • 0431be8d6c Merge branch 'master' of https://github.com/yacy/yacy_search_server sixcooler 2015-10-06 08:47:00 +02:00
  • 073ef730af Merge branch 'master' of https://github.com/yacy/yacy_search_server.git luc 2015-10-06 08:23:32 +02:00
  • 5744342fec handle image preview for url w empty file extension fix of commit 688f7b2a5c reger 2015-10-06 04:13:04 +02:00
  • 82dd004260 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git luc 2015-10-05 09:06:28 +02:00
  • 851e8f6c8a check jpeg file signature in genericImageParser to fail early without further object allocation if source is not a jpeg. reger 2015-10-05 01:58:31 +02:00
  • fb75fea446 use recrawljob w/o sort results by date This is a workaround for existing index (not fully reindexed) since intro of schema with docvalues to prevent solr exception causing recrawljob to fail with org.apache.solr.core.SolrCore java.lang.IllegalStateException: unexpected docvalues type NONE for field 'load_date_dt' (expected=NUMERIC). Use UninvertingReader or index with docvalues. reger 2015-10-04 05:43:40 +02:00
  • 3cbf86f295 Merge branch 'master' of git@github.com:yacy/yacy_search_server.git Michael Peter Christen 2015-10-04 05:43:25 +02:00
  • 23f6294a2d removed unused import Michael Peter Christen 2015-10-04 05:43:16 +02:00
  • 43c27aa550 upd to solr/lucene 5.3.1 reger 2015-10-03 23:20:33 +02:00
  • fd5a1dc297 upd to poi-3.13 reger 2015-10-03 21:43:41 +02:00
  • 839d710105 Merge branch 'master' of https://github.com/yacy/yacy_search_server sixcooler 2015-10-03 11:09:16 +02:00
  • 0ae9297ca5 Created a html test page to check ViewImage rendering with different file formats. luc 2015-10-02 12:41:30 +02:00
  • 136e8f6fbd Merge branch 'master' of https://github.com/yacy/yacy_search_server.git luc 2015-10-02 11:02:27 +02:00
  • 688f7b2a5c allow/display svg images in image results previews svg is not supported by awt but by most browser. Image content is delivered as received (without size adjustment) reger 2015-10-02 01:48:48 +02:00
  • d5330391de remove some unused var allocation in parser reger 2015-10-01 23:11:58 +02:00
  • 3d7dd9d3aa follow-up to latest commit: also flush the search cache if all crawls had been terminated. Michael Peter Christen 2015-10-01 13:21:28 +02:00
  • 225200194a every time a crawl is started, the user expects a different search result behaviour. This requires that the search cache is flushed for each crawl start. TODO: this should also be done if a crawl is terminated. Michael Peter Christen 2015-10-01 13:18:44 +02:00
  • c737ff235d in case that the include_string contains several entries including 1-char tokens and also more-than-1-char tokens, then remove the 1-char tokens to prevent that we are to strict. This will make it possible to be a bit more fuzzy in the search where it is appropriate. Michael Peter Christen 2015-10-01 13:09:33 +02:00
  • 8e555d79a3 add also 1-character tokens to the token list because that could be also searched for. A full-string search for a filename may fail if those 1-char tokens are omitted Michael Peter Christen 2015-10-01 13:03:22 +02:00
  • 1091e25f4c Merge branch 'master' of https://github.com/yacy/yacy_search_server sixcooler 2015-09-30 20:37:42 +02:00
  • eb7989b17b Merge branch 'master' of https://github.com/yacy/yacy_search_server.git luc 2015-09-30 08:43:58 +02:00
  • 7c82cd4415 add a end condition to svgParser for wrong content (if parser choosen just by file extension) reger 2015-09-29 22:57:33 +02:00
  • 82f4f221e9 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git luc 2015-09-28 09:47:11 +02:00
  • 9c2cd7e87b Merge branch 'master' of https://github.com/yacy/yacy_search_server sixcooler 2015-09-27 18:46:02 +02:00
  • b92d81b073 remove double caching of inputstream in ViewImage reger 2015-09-27 03:24:28 +02:00
  • c7c5e2dff9 fix old/obsolete solr dependency to stax delete obsolete jar reger 2015-09-27 00:17:42 +02:00
  • beed1c417e Add report profile with OWASP Dependency-Check to maven pom reger 2015-09-26 19:58:15 +02:00
  • 356d4d1301 remove rdfParser from init (current function identical with genericParser) reger 2015-09-26 17:30:34 +02:00
  • c647d899e3 add svgParser to parse metadate from svg images Reads document level included title and description and skips the graphic content to save bandwidth. svg metadata element is not interpreted - remove rdfParser from init (current function identical with genericParser) reger 2015-09-26 17:27:33 +02:00
  • bad34804fe optimize parseInt for <img> tag attribute parsing Performance better as using Numberformat.parse or parseInt(substring()) reger 2015-09-26 15:42:23 +02:00
  • 68c6d6ca7a Merge branch 'master' of https://github.com/yacy/yacy_search_server sixcooler 2015-09-24 20:20:14 +02:00
  • 3c31bf845f fix for latest merge Michael Peter Christen 2015-09-24 13:53:54 +02:00
  • 6ebc2451a9 Merge pull request #14 from luccioman/master Michael Peter Christen 2015-09-24 13:50:23 +02:00