Commit Graph

  • 7de7879f13 added a cache to prevent too many seed enumerations Michael Peter Christen 2017-05-18 00:28:00 +02:00
  • bd7411a53a Enable p2p and cluster communication when "Protection of all pages" on luccioman 2017-05-17 09:00:29 +02:00
  • 45346c1be8 Added missing accessibility attributes on search results progress bar. luccioman 2017-05-16 09:44:13 +02:00
  • 91a06bc669 Annotated search result information separators for screen readers. luccioman 2017-05-15 13:31:24 +02:00
  • 31ad043bb9 Added user interface feedback on results feeding termination status. luccioman 2017-05-15 13:15:16 +02:00
  • ff6392215e added closing of lst-Tag in solr-Export sgaebel 2017-05-13 20:38:25 +02:00
  • d90b001e1b Improved previous merge "Show ranking in HTML UI". luccioman 2017-05-11 18:02:33 +02:00
  • efe1232d90 Merge branch 'html-show-ranking' of https://github.com/JeremyRand/yacy_search_server luccioman 2017-05-11 14:53:57 +02:00
  • 0f0f42b509 Added some JavaDoc luccioman 2017-05-11 08:33:19 +02:00
  • 077d062be3 Adjust mergeDocuments to keep youngest last-modified date of document collection reger 2017-05-09 22:52:54 +02:00
  • 654801523e Fixed StringIndexOutOfBoundsException case. luccioman 2017-05-09 18:32:47 +02:00
  • b297f5bdbe Updated Debian package post install script admin password encoding. luccioman 2017-05-09 12:20:41 +02:00
  • 7623d7728f Fixed Debian install message misspelling. luccioman 2017-05-09 12:15:41 +02:00
  • 522a268305 Improved new blacklist entries URL scheme detection. luccioman 2017-05-04 16:36:45 +02:00
  • 532981b363 Updated putHTML() JavaDoc luccioman 2017-05-04 11:21:27 +02:00
  • 58d23047dd Handle '?' and '+' chars as valid wild cards when adding to blacklist. luccioman 2017-05-04 11:19:59 +02:00
  • 4564541b3b Fixed blacklist Regex containing '+' characters rendering. luccioman 2017-05-04 11:12:58 +02:00
  • 0612a8f4f2 Fixed the previously added link to scheduled dump operations. luccioman 2017-05-04 08:45:30 +02:00
  • a87281b498 Added MediaWiki dump import scheduling feature. luccioman 2017-05-03 18:53:01 +02:00
  • 10c03c6c64 Improved MediaWiki dump import monitoring. luccioman 2017-05-02 09:38:45 +02:00
  • edd7ccac40 Added some JavaDoc luccioman 2017-05-02 09:33:11 +02:00
  • 79fdf14b0a Fixed regression introduced by commit 9ad4d16 luccioman 2017-05-02 09:32:04 +02:00
  • 7678fd67e3 copied fix from yacy_grid_parser for wrong array type Michael Peter Christen 2017-05-01 11:44:26 +02:00
  • 200b100fb8 added patch to rewrite altered yacy grid schema into yacy schema Michael Peter Christen 2017-05-01 11:38:02 +02:00
  • 9ad4d16829 Add a responsHeader to the solr index export with a format identifier and export parameter (in accordance with response xml format) for easier format detection on import. reger 2017-04-30 23:53:52 +02:00
  • 9697209ef6 Fixed Index Export feature for compatibility with old indexed documents. luccioman 2017-04-28 11:39:51 +02:00
  • 88c062639b Added some JavaDoc luccioman 2017-04-28 11:36:48 +02:00
  • 8d288f5dba Crawl results page : apply table lines number limit. luccioman 2017-04-27 18:24:54 +02:00
  • 31fff2c986 Extended WikiCode template inclusion syntax support. luccioman 2017-04-27 09:50:04 +02:00
  • 973d74712f added yacy grid flatjson surrogate parser Michael Peter Christen 2017-04-25 08:44:02 +02:00
  • b1da92648e Fixed surrogates import monitoring page (/CrawlResults.html?process=7) luccioman 2017-04-24 18:24:26 +02:00
  • 527d494c1a Fixed "Unchecked conversion" compilation warnings. luccioman 2017-04-24 13:27:07 +02:00
  • 2b03e40134 upd to jwat-1.0.5 reger 2017-04-22 23:32:40 +02:00
  • 7a7da698d4 fix unit test MultiProtocolURL(file) assertion for Windows path with drive letter. reger 2017-04-20 00:47:52 +02:00
  • c77e43a391 Take out mailto collect in internal parsed document As earlier plans to make use of mailto as separate webgraph entity didn't materialize (see http://forum.yacy-websuche.de/viewtopic.php?f=8&t=5726&p=32493&hilit=mailto#p32493) free the unused handling and resources. reger 2017-04-20 00:18:18 +02:00
  • 335868edba Merge branch 'master' of git@github.com:yacy/yacy_search_server.git Michael Peter Christen 2017-04-17 12:26:27 +02:00
  • bec34d3546 Add url input field as source for WarcImporter allowing to import warc from url without prior download. reger 2017-04-16 04:25:29 +02:00
  • d3df8a46c4 fix unresolved_pattern on missing post parameter api/message.html reger 2017-04-14 21:14:26 +02:00
  • f66438442e Extended Mediawiki dump import to remote URLs. luccioman 2017-04-14 14:32:44 +02:00
  • e5c3b16748 Improved http client close time on stream processing errors. luccioman 2017-04-14 14:23:50 +02:00
  • 23775e76e2 Fixed endless loop case in wikicode processing. luccioman 2017-04-12 17:17:03 +02:00
  • 0bc868a819 Improved support for non ASCII chars in local file system URLs luccioman 2017-04-12 09:23:10 +02:00
  • 7edddd7b0d Improved error reports on various wiki dump prerequisites failure cases. luccioman 2017-04-11 08:21:34 +02:00
  • dfe8d4139b Used a text input for wiki dump import file selection. luccioman 2017-04-11 07:34:17 +02:00
  • 3a71430030 Adjust ConfigSearchPage_p to activated hosts navigator as plugin reger 2017-04-10 22:58:20 +02:00
  • 7b80189bda Activate hosts navigator plugin. This includes rwi results in the navigator count. This might be tangential related to http://mantis.tokeek.de/view.php?id=736 as the example includes a local index search, while rwi results are not counted. reger 2017-04-10 22:42:06 +02:00
  • 05a1b14b4a add missing text from ConfigRobotsTxt_p to master.lng and link to Translation Editor to Translation News page. reger 2017-04-09 21:42:05 +02:00
  • a39c00a93f add servlet to list user in UserDB and made user editor available in separate servlet for a quick and easy overview of configured user and selection for edit. reger 2017-04-09 02:09:32 +02:00
  • a4498e17c0 fix edit current user form to required post mehtod introduced with cde237b687 reger 2017-04-08 22:54:57 +02:00
  • f5ad29edb1 Merge branch 'master' of git@github.com:yacy/yacy_search_server.git Michael Peter Christen 2017-04-07 09:15:15 +02:00
  • 76e9135526 added flatjson parser (stub, unfinished) Michael Peter Christen 2017-04-07 09:15:05 +02:00
  • 46a4aaf09c upd to Solr-5.5.4 reger 2017-04-06 21:18:01 +02:00
  • b7417ac329 Introduce a Keyword search navigator using the index field keywords. The keywords field string is split into words as navigator entries. reger 2017-04-05 00:08:25 +02:00
  • eddb7a9804 upd to pdfbox-2.0.5.jar and transient dependency xmpcore-5.1.3.jar required by metadata-extractor-2.10.1 (fix build.xml compiler warning) reger 2017-04-04 00:59:26 +02:00
  • 27884da1ff add CookieTest_p.html text to master.lng reger 2017-04-03 22:53:07 +02:00
  • 665d087d76 Enforced access controls on a few more administration pages. luccioman 2017-04-03 12:20:16 +02:00
  • 0feded21dd Escaped HTML eventually active content from recorded API call comments. luccioman 2017-04-03 11:40:37 +02:00
  • 09e72eb0a4 Set Config Portal as a private administration page. luccioman 2017-04-03 11:34:49 +02:00
  • c19d60f06b update master.lng with recent text changes to IndexExport_p.html, IndexImportWarc_p.html reger 2017-04-02 22:30:23 +02:00
  • 9339a6a4c5 use css error class for error msg in IndexImportOAIPMH_p.html, adjust to xhtml <p> usage rule reger 2017-04-02 20:36:22 +02:00
  • 777cb5b812 remove test case for Standard_MemoryControl which will always fail see https://github.com/yacy/yacy_search_server/pull/114 reger 2017-04-02 03:59:37 +02:00
  • ba339a2a45 Add servlet to import warc file from filesystem IndexImportWarc_p.html. Apply Importer interface to WarcImporter reger 2017-04-02 03:32:21 +02:00
  • 1d81b8f102 Merge branch 'master' of git@github.com:yacy/yacy_search_server.git Michael Peter Christen 2017-04-01 01:04:27 +02:00
  • 69081bce00 added export to elasticsearch. The export dump can easily be imported to elasticsearch using the command curl -XPOST localhost:9200/collection1/yacy/_bulk --data-binary @yacy_dump_XXX.flatjson Michael Peter Christen 2017-04-01 01:04:17 +02:00
  • 510f11d374 Implement surrogate import from Warc archives (as first option handle warc = Web ARChive File Format. Warc files with extension .warc or compressed warc.gz can be placed in the DATA/surrogate/in and contained responses are imported to the index. The used library is stream based so we can easily extend it later to use and load warc's from the net. reger 2017-03-31 00:58:11 +02:00
  • 5b5b9d5d96 URL Viewer : only display the link to metadata when metadata exists luccioman 2017-03-30 16:14:22 +02:00
  • 4b649b0a11 Fixed NPE case and API URL link on Solr HTML output for webgraph core. luccioman 2017-03-30 15:41:14 +02:00
  • 39ffa42a3c Modified RWI settings page radio click event to use HTTP POST luccioman 2017-03-30 10:23:47 +02:00
  • af28a07780 Updated API calls recording/replay with recent changes. luccioman 2017-03-30 09:22:28 +02:00
  • 1ccc44e681 fix default/httpd.mime Z file extension to lower case + test case reger 2017-03-26 23:52:31 +02:00
  • 44a9a580e3 remove seedlist bootstrap target (not working for some longer time) reger 2017-03-26 23:26:40 +02:00
  • c16498305b Add label text for search word statistic (AccessTracker_p.html) to master lng file reger 2017-03-26 23:13:12 +02:00
  • 81670c3484 One more use of SwitchboardConstants.SERVER_PORT constant, apply standard servlet design pattern initialization of solrselectservlet reger 2017-03-26 20:05:48 +02:00
  • cde237b687 Enforced access controls on some administrative actions. luccioman 2017-03-26 11:48:00 +02:00
  • df5970df6d Extended Apache HTTP Digest Auth. for use of YaCy encoded password luccioman 2017-03-26 11:29:04 +02:00
  • 40403942db Updated dump/restore shell scripts : the API is now IndexExport_p.html luccioman 2017-03-26 10:59:04 +02:00
  • 29e5110627 Updated shell scripts to be compatible with HTTP Digest authentication luccioman 2017-03-21 17:15:01 +01:00
  • bdadbda5fa Update master lng file with added text in Settings_ServerAccess remove outdated file entry in fr.lng & sk.lng reger 2017-03-21 01:16:16 +01:00
  • 1537157839 adjusted .travis.yml to build in libbuild first (see http://mantis.tokeek.de/view.php?id=545); added test of build instructions Karl-Philipp Richter 2016-10-25 05:06:42 +02:00
  • c55d526cb8 Add hint how to build with maven (for the first time) to readme reger 2017-03-20 02:33:21 +01:00
  • cbf58d5f0a Add hint text to default ServerAcess Port Settings page reger 2017-03-19 21:45:33 +01:00
  • f05976c017 Display the local search word statistic in alphabetic order reger 2017-03-19 07:12:35 +01:00
  • 3dd23c178b Introduce the option to configure a shutdown port. A port value of -1 will disable this option. reger 2017-03-19 02:30:08 +01:00
  • c4d5f1fc54 upd to slf4j-1.7.24.jar reger 2017-03-18 20:32:53 +01:00
  • c4b90eae98 upd to icu4j-58_2.jar reger 2017-03-18 20:06:58 +01:00
  • a2afb4bae0 add switchboardconstants for server ports config keys reger 2017-03-18 20:02:26 +01:00
  • e0c5b28331 update to jsoup-1.10.2.jar reger 2017-03-17 02:19:33 +01:00
  • 5b5ada38c3 update to jsch-0.1.54.jar reger 2017-03-17 02:07:02 +01:00
  • 038b9cd98e update translation for ConfigNetwork_p.html reger 2017-03-15 22:36:53 +01:00
  • f7fce1baad make digest default authentication in defaults/web.xml reger 2017-03-15 01:39:15 +01:00
  • 56d0a87a83 remove double occuance of geo:lat in rss tokens reger 2017-03-13 03:08:44 +01:00
  • 882d99dae4 upd to metadata-extractor-2.10.1.jar reger 2017-03-13 00:34:40 +01:00
  • b4fa1141b8 implement RequestHeader getRequestURI, getRequestURL for legacy request reger 2017-03-12 01:54:56 +01:00
  • 209a7374bd remove unused import pdfParser reger 2017-03-09 22:57:51 +01:00
  • de1c1c16db Improve pdf text extraction resource handling. For sort pdf <= 3 pages use already extracted content, only for long pdf > 3 pages reassign content and close internal writer (to direct free buffers) reger 2017-03-09 22:56:33 +01:00
  • 52c9d0c858 upd to pdfbox-2.0.4.jar reger 2017-03-09 22:50:19 +01:00
  • 9b6d1abd9e eliminate some compiler unchecked and deprecation warnings in nav plugins by explicite type declaration and replacing date.getYear with Calendar.get reger 2017-03-09 01:42:36 +01:00
  • 6eb7d27449 upd to httpclient v4.5.3 reger 2017-03-08 22:35:48 +01:00
  • 8e77fe3860 Fixed unresolved pattern case in search results progress bar. luccioman 2017-03-08 10:27:18 +01:00
  • 79df5bb20a Fixed settingsAck_p.html back link for case where referrer is stripped. luccioman 2017-03-07 12:27:27 +01:00