Commit Graph

  • 0f80c978d6 Limit the number of initially previewed links in crawl start pages. luccioman 2017-06-17 09:33:14 +02:00
  • d2a4a27f52 Improved stream-oriented parsing entering conditions. luccioman 2017-06-17 09:26:37 +02:00
  • 32288a8999 Merge branch 'master' of https://github.com/yacy/yacy_search_server luccioman 2017-06-17 08:16:55 +02:00
  • e9b4b29f90 Limit scope of some local JavaScript variables. luccioman 2017-06-16 08:50:57 +02:00
  • 369b8e0e0b added json(p) endpoint for crawl start Michael Peter Christen 2017-06-16 08:44:40 +02:00
  • 83ba45ebae make nsis build script require java 8 reger 2017-06-16 06:31:45 +02:00
  • cf70081cfc update nsi installer java autodl bundleid to use jre-8u131 reger 2017-06-16 02:17:49 +02:00
  • 9220ccbec7 remove reference to velocityresponsewriter in solrconfig.xml it is not longer part of solr-core api http://lucene.apache.org/solr/6_6_0/index.html reger 2017-06-16 00:12:09 +02:00
  • 4be4bfbba6 remove sample path setting in solrconfig.xml not valid in Yacy resulting in startup stop exception after fresh swithch to 1.921 reger 2017-06-15 21:02:18 +02:00
  • 510859bcce update maven pom setting to YaCy version 1.921 java 1.8 and solr 6.6 reger 2017-06-15 20:24:53 +02:00
  • f6e8d71718 Prevent high CPU load at startup, caused by the Solr suggester build. luccioman 2017-06-15 14:13:46 +02:00
  • 9dd790087d Added HT Cache basic statistics (hit rate) luccioman 2017-06-15 09:50:02 +02:00
  • 5fdd5d16b1 Use volatile to ensure concurrent threads use up to date property value luccioman 2017-06-15 09:48:22 +02:00
  • 28b451a0b3 Made Cache compression level and lock timeout user configurable luccioman 2017-06-14 19:02:08 +02:00
  • a7394b479b Limit the synchronization blocking time on some Cache operations. luccioman 2017-06-14 09:13:50 +02:00
  • 73ab4a7b3a Prevent log pollution from unwanted Solr warnings. luccioman 2017-06-14 08:56:11 +02:00
  • c94a8c76bd re-added solr synchronization hack Michael Peter Christen 2017-06-09 12:50:36 +02:00
  • 6fe735945d migrated Solr 5.5 -> Solr 6.6 and from Java 1.7 -> 1.8 Also: now Version 1.921 Michael Peter Christen 2017-06-09 12:25:23 +02:00
  • ce89492319 Ensure system resource release by closing document stream. luccioman 2017-06-08 07:36:11 +02:00
  • 8399275142 Properly close file output streams even on exceptions scenarios. luccioman 2017-06-08 07:19:16 +02:00
  • 4e4dc6c4e5 Removed unnecessary finalize implementation. luccioman 2017-06-06 10:30:02 +02:00
  • 632354e2ff Tokenize result entry keywords and add some styling for display reger 2017-06-04 01:50:40 +02:00
  • c42d17f607 upd to commons-compress-1.14.jar reger 2017-06-03 21:58:04 +02:00
  • a04feac064 Ensure file input streams proper closing in both success and failures luccioman 2017-06-03 04:00:46 +02:00
  • d98c04853d Ensure proper closing of file input streams. luccioman 2017-06-02 12:14:29 +02:00
  • c53c58fa85 Unsure closing ChunkIterator stream in every possible use case. luccioman 2017-06-02 09:47:45 +02:00
  • 29e52bda39 Merge branch 'master' of https://github.com/yacy/yacy_search_server luccioman 2017-06-02 01:47:53 +02:00
  • a9cb083fa1 Improved consistency between loader openInputStream and load functions luccioman 2017-06-02 01:46:06 +02:00
  • a814f3d885 Introduce keyword query parameter This enables keyword navigator to filter on keywords. Added search page output and layout config for keywords, allowing e.g. in Intranet use to display the keywords. No styling or links applied to the keyword text (but is desirable possibly in combination with bootstrap-tagsinput for future/intranet). reger 2017-06-02 01:00:21 +02:00
  • cbccf97361 Added JavaDoc to the getpageinfo_p API servlet. luccioman 2017-05-30 17:38:16 +02:00
  • c226ded799 Fix unescape of URLs having some '%' chars but not percent-encoded luccioman 2017-05-30 12:32:14 +02:00
  • bd88fd303e Deprecated duplicated and internally unused getpageinfo servlet. luccioman 2017-05-30 09:29:28 +02:00
  • 306a82dd71 Fixed scraper NullPointerException cases on malformed URLs. luccioman 2017-05-30 08:48:20 +02:00
  • aa55d71cf5 Fixed a NullPointerException case on Digest authentication. luccioman 2017-05-29 19:16:09 +02:00
  • b65a04087b upd to pdfbox-2.0.6.jar reger 2017-05-24 22:13:42 +02:00
  • 02ec0ed13c Quoted param value in Solr query to avoid unwanted traces in logs luccioman 2017-05-24 08:43:03 +02:00
  • 1be4d32f99 Restored search page default behavior for Tab, Page Up and Down keys luccioman 2017-05-23 07:25:40 +02:00
  • 1737af37cf Set request originator to own peer in warc importer in addition to change in 039162fbf0 reger 2017-05-22 01:56:11 +02:00
  • 039162fbf0 Change warc importer to use defaultsurrogate-crawl profile, as reported by LA_FORGE http://forum.yacy-websuche.de/viewtopic.php?f=5&t=5990 and analysed by @luccioman (see comment 510f11d374) it creates conflict using a other crawlprofile without setting originator. reger 2017-05-22 01:34:08 +02:00
  • 3b1d640a3c enhanced debugging Michael Peter Christen 2017-05-18 00:28:12 +02:00
  • 7de7879f13 added a cache to prevent too many seed enumerations Michael Peter Christen 2017-05-18 00:28:00 +02:00
  • bd7411a53a Enable p2p and cluster communication when "Protection of all pages" on luccioman 2017-05-17 09:00:29 +02:00
  • 45346c1be8 Added missing accessibility attributes on search results progress bar. luccioman 2017-05-16 09:44:13 +02:00
  • 91a06bc669 Annotated search result information separators for screen readers. luccioman 2017-05-15 13:31:24 +02:00
  • 31ad043bb9 Added user interface feedback on results feeding termination status. luccioman 2017-05-15 13:15:16 +02:00
  • ff6392215e added closing of lst-Tag in solr-Export sgaebel 2017-05-13 20:38:25 +02:00
  • d90b001e1b Improved previous merge "Show ranking in HTML UI". luccioman 2017-05-11 18:02:33 +02:00
  • efe1232d90 Merge branch 'html-show-ranking' of https://github.com/JeremyRand/yacy_search_server luccioman 2017-05-11 14:53:57 +02:00
  • 0f0f42b509 Added some JavaDoc luccioman 2017-05-11 08:33:19 +02:00
  • 077d062be3 Adjust mergeDocuments to keep youngest last-modified date of document collection reger 2017-05-09 22:52:54 +02:00
  • 654801523e Fixed StringIndexOutOfBoundsException case. luccioman 2017-05-09 18:32:47 +02:00
  • b297f5bdbe Updated Debian package post install script admin password encoding. luccioman 2017-05-09 12:20:41 +02:00
  • 7623d7728f Fixed Debian install message misspelling. luccioman 2017-05-09 12:15:41 +02:00
  • 522a268305 Improved new blacklist entries URL scheme detection. luccioman 2017-05-04 16:36:45 +02:00
  • 532981b363 Updated putHTML() JavaDoc luccioman 2017-05-04 11:21:27 +02:00
  • 58d23047dd Handle '?' and '+' chars as valid wild cards when adding to blacklist. luccioman 2017-05-04 11:19:59 +02:00
  • 4564541b3b Fixed blacklist Regex containing '+' characters rendering. luccioman 2017-05-04 11:12:58 +02:00
  • 0612a8f4f2 Fixed the previously added link to scheduled dump operations. luccioman 2017-05-04 08:45:30 +02:00
  • a87281b498 Added MediaWiki dump import scheduling feature. luccioman 2017-05-03 18:53:01 +02:00
  • 10c03c6c64 Improved MediaWiki dump import monitoring. luccioman 2017-05-02 09:38:45 +02:00
  • edd7ccac40 Added some JavaDoc luccioman 2017-05-02 09:33:11 +02:00
  • 79fdf14b0a Fixed regression introduced by commit 9ad4d16 luccioman 2017-05-02 09:32:04 +02:00
  • 7678fd67e3 copied fix from yacy_grid_parser for wrong array type Michael Peter Christen 2017-05-01 11:44:26 +02:00
  • 200b100fb8 added patch to rewrite altered yacy grid schema into yacy schema Michael Peter Christen 2017-05-01 11:38:02 +02:00
  • 9ad4d16829 Add a responsHeader to the solr index export with a format identifier and export parameter (in accordance with response xml format) for easier format detection on import. reger 2017-04-30 23:53:52 +02:00
  • 9697209ef6 Fixed Index Export feature for compatibility with old indexed documents. luccioman 2017-04-28 11:39:51 +02:00
  • 88c062639b Added some JavaDoc luccioman 2017-04-28 11:36:48 +02:00
  • 8d288f5dba Crawl results page : apply table lines number limit. luccioman 2017-04-27 18:24:54 +02:00
  • 31fff2c986 Extended WikiCode template inclusion syntax support. luccioman 2017-04-27 09:50:04 +02:00
  • 973d74712f added yacy grid flatjson surrogate parser Michael Peter Christen 2017-04-25 08:44:02 +02:00
  • b1da92648e Fixed surrogates import monitoring page (/CrawlResults.html?process=7) luccioman 2017-04-24 18:24:26 +02:00
  • 527d494c1a Fixed "Unchecked conversion" compilation warnings. luccioman 2017-04-24 13:27:07 +02:00
  • 2b03e40134 upd to jwat-1.0.5 reger 2017-04-22 23:32:40 +02:00
  • 7a7da698d4 fix unit test MultiProtocolURL(file) assertion for Windows path with drive letter. reger 2017-04-20 00:47:52 +02:00
  • c77e43a391 Take out mailto collect in internal parsed document As earlier plans to make use of mailto as separate webgraph entity didn't materialize (see http://forum.yacy-websuche.de/viewtopic.php?f=8&t=5726&p=32493&hilit=mailto#p32493) free the unused handling and resources. reger 2017-04-20 00:18:18 +02:00
  • 335868edba Merge branch 'master' of git@github.com:yacy/yacy_search_server.git Michael Peter Christen 2017-04-17 12:26:27 +02:00
  • bec34d3546 Add url input field as source for WarcImporter allowing to import warc from url without prior download. reger 2017-04-16 04:25:29 +02:00
  • d3df8a46c4 fix unresolved_pattern on missing post parameter api/message.html reger 2017-04-14 21:14:26 +02:00
  • f66438442e Extended Mediawiki dump import to remote URLs. luccioman 2017-04-14 14:32:44 +02:00
  • e5c3b16748 Improved http client close time on stream processing errors. luccioman 2017-04-14 14:23:50 +02:00
  • 23775e76e2 Fixed endless loop case in wikicode processing. luccioman 2017-04-12 17:17:03 +02:00
  • 0bc868a819 Improved support for non ASCII chars in local file system URLs luccioman 2017-04-12 09:23:10 +02:00
  • 7edddd7b0d Improved error reports on various wiki dump prerequisites failure cases. luccioman 2017-04-11 08:21:34 +02:00
  • dfe8d4139b Used a text input for wiki dump import file selection. luccioman 2017-04-11 07:34:17 +02:00
  • 3a71430030 Adjust ConfigSearchPage_p to activated hosts navigator as plugin reger 2017-04-10 22:58:20 +02:00
  • 7b80189bda Activate hosts navigator plugin. This includes rwi results in the navigator count. This might be tangential related to http://mantis.tokeek.de/view.php?id=736 as the example includes a local index search, while rwi results are not counted. reger 2017-04-10 22:42:06 +02:00
  • 05a1b14b4a add missing text from ConfigRobotsTxt_p to master.lng and link to Translation Editor to Translation News page. reger 2017-04-09 21:42:05 +02:00
  • a39c00a93f add servlet to list user in UserDB and made user editor available in separate servlet for a quick and easy overview of configured user and selection for edit. reger 2017-04-09 02:09:32 +02:00
  • a4498e17c0 fix edit current user form to required post mehtod introduced with cde237b687 reger 2017-04-08 22:54:57 +02:00
  • f5ad29edb1 Merge branch 'master' of git@github.com:yacy/yacy_search_server.git Michael Peter Christen 2017-04-07 09:15:15 +02:00
  • 76e9135526 added flatjson parser (stub, unfinished) Michael Peter Christen 2017-04-07 09:15:05 +02:00
  • 46a4aaf09c upd to Solr-5.5.4 reger 2017-04-06 21:18:01 +02:00
  • b7417ac329 Introduce a Keyword search navigator using the index field keywords. The keywords field string is split into words as navigator entries. reger 2017-04-05 00:08:25 +02:00
  • eddb7a9804 upd to pdfbox-2.0.5.jar and transient dependency xmpcore-5.1.3.jar required by metadata-extractor-2.10.1 (fix build.xml compiler warning) reger 2017-04-04 00:59:26 +02:00
  • 27884da1ff add CookieTest_p.html text to master.lng reger 2017-04-03 22:53:07 +02:00
  • 665d087d76 Enforced access controls on a few more administration pages. luccioman 2017-04-03 12:20:16 +02:00
  • 0feded21dd Escaped HTML eventually active content from recorded API call comments. luccioman 2017-04-03 11:40:37 +02:00
  • 09e72eb0a4 Set Config Portal as a private administration page. luccioman 2017-04-03 11:34:49 +02:00
  • c19d60f06b update master.lng with recent text changes to IndexExport_p.html, IndexImportWarc_p.html reger 2017-04-02 22:30:23 +02:00
  • 9339a6a4c5 use css error class for error msg in IndexImportOAIPMH_p.html, adjust to xhtml <p> usage rule reger 2017-04-02 20:36:22 +02:00