Commit Graph

  • c996b04741 HTML validation : fixed URL encoding of search results action links. luccioman 2016-10-12 09:16:47 +02:00
  • 2b81703828 Refactored search result action links construction. luccioman 2016-10-12 08:45:32 +02:00
  • e68b00678e prevent negative score on URIMetadataNode - in the special case were no solr score is supplied. + assert before use & test case reger 2016-10-11 19:54:50 +02:00
  • 242707f9b4 Fixed loadFromCache with strategy IFFRESH. luccioman 2016-10-10 01:10:35 +02:00
  • c778219768 remove module for swfparser from maven parent pom not longer required for the build see a4465c97d6 reger 2016-10-07 23:49:03 +02:00
  • 094aed8664 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git luccioman 2016-10-07 11:06:34 +02:00
  • c7402a2f89 Removed invalid empty form action. luccioman 2016-10-07 10:57:31 +02:00
  • 37df2e19fd Removed xmlns attribute which no more makes sense in HTML5 pages. luccioman 2016-10-07 10:46:20 +02:00
  • 94924e288f Added some accessibility improvements to the main interface. luccioman 2016-10-07 10:44:45 +02:00
  • dd86f7c44e Fixed HTML validation errors and grouped radios options in fieldsets luccioman 2016-10-07 10:43:06 +02:00
  • fc0c72c84b Switched to the short HTML Doctype luccioman 2016-10-07 10:42:23 +02:00
  • 7c81160f45 correct blacklist export as text url to blacklists_p.txt was using servlet for network access and missing network.unit.name fix for http://mantis.tokeek.de/view.php?id=694 + prevent unresoved_pattern in yacy/list servlet reger 2016-10-07 03:03:41 +02:00
  • b752bcfecb adjust date in text detection to ignore some program version strings like "3.1.2.0102" see http://mantis.tokeek.de/view.php?id=650 + expand test case reger 2016-10-06 23:37:12 +02:00
  • b017e97421 optimize condenser language detection a little. langdetect probabilities take letter case into account, add words from description and anchors etc. as is. + add it to javadoc reger 2016-10-06 19:03:52 +02:00
  • ae3717d087 adjust Tokenizer sentence count to ignore repeated punktuation (like !!!! ) + remove unused sentenceword map (we use only the count) + upd test case for sentence count reger 2016-10-06 03:41:07 +02:00
  • b5eb7a9217 Removed unnecessary crawlingDomFilterDepth hidden field. luccioman 2016-10-05 13:48:22 +02:00
  • f6d7c6ee1f Fixed Recorded action URLs beginning displayed in /Table_API_p.html luccioman 2016-10-05 12:20:37 +02:00
  • 474f0476c6 adjust Tokenizer sentence count on trailing text after last recognized sentence + upd test case for rwi multi-word-query (leaving results known to fail untested) reger 2016-10-05 05:52:37 +02:00
  • 4963ecb0a0 Add preference (disabled by default) to show the ranking for each result on the HTML UI. JeremyRand 2016-10-04 11:49:16 +00:00
  • 34658ddb9b Merge pull request #76 from luccioman/crawler luccioman 2016-10-04 05:06:18 +02:00
  • 0065c9b9ea Crawl monitoring : refresh running crawls table luccioman 2016-10-04 03:55:49 +02:00
  • e1e632ad84 Switched to the short HTML Doctype luccioman 2016-10-03 21:57:02 +02:00
  • 4d8611e5e7 Tables accessibility : added missing <thead> sections. luccioman 2016-10-03 21:55:38 +02:00
  • 9fb3142317 Restricted variables scope to function handleStatus() in Crawler.js luccioman 2016-10-03 21:52:24 +02:00
  • 3861ac9293 upd maven dependency-check plugin to reflect changes of https://nvd.nist.gov + upd unknown ant script with current lib/jsch version reger 2016-10-04 03:05:26 +02:00
  • 681a61dafb adjust rwi index result word position handling used for rwi ranking - correct WordReferenceVars.toRowEntry posintext parameter to set expected min posintext (the difference is on multi-word queries, while positions are ordered by search word order). - modified posofphrase/posinphrase join operation - to set min posofphrase - and keep posinphrase if not same posofphrase (was set to 0, no differentiation during ranking) + fix compiler msg (missing type declaration) reger 2016-10-04 01:42:18 +02:00
  • 14f7577231 add support for older Word versions (Word6/Word95) to docParser reger 2016-10-03 01:52:51 +02:00
  • 8794e06721 upd to poi-3.15.jar reger 2016-10-03 01:48:35 +02:00
  • e25f2ee88b mention date search parameter in search option help (index.html) reger 2016-10-02 06:36:34 +02:00
  • 1a79c64495 generalize DateDetection with holiday date rules readily available in icu to make sure current dates are recognized (was fixed to 2014 - 2016) + adjust holiday date parser from pattern.match to pattern.find to deal with leading and trailing text + moved relative date recognition (morgen, tomorrow) to parseline (used by query parser only), as not working and problematic for indexing + add test case for parseline (used by query parser) reger 2016-10-02 03:19:12 +02:00
  • 6f68f08354 correct DateDetection Silvester date add Thanksgiving reger 2016-10-01 03:16:27 +02:00
  • 32a2e3a22a have RSSFeed.getChannel return empty message on missing channel element, a) required b) prevent NPE in rss servlets + add test reger 2016-09-30 21:46:57 +02:00
  • fedb9f8151 del double entry in master.lng reger 2016-09-30 21:42:42 +02:00
  • 8d57b5b970 Added some javadocs. luccioman 2016-09-30 17:12:55 +02:00
  • 4585a60d7e Made use of the constant corresponding to the hard-coded value. luccioman 2016-09-30 17:12:29 +02:00
  • 60df09fff9 Fixed some HTML validation errors : Illegal character in query luccioman 2016-09-30 10:54:53 +02:00
  • a76a46a2e9 Removed invalid rel="[count]" from links in tagcloud. luccioman 2016-09-30 09:43:51 +02:00
  • 862f28eaa6 display number of documents/rss-items for label "docs" in load_rss_p servlet (as replacement for the rarely used "docs" rss-tag for a url to the rss-specification) reger 2016-09-29 23:59:10 +02:00
  • 5027912f30 Fixed <p> spacers : blocks elements such as <div> are not allowed inside luccioman 2016-09-29 14:24:15 +02:00
  • abe489a0b5 Removed unnecessary ARIA "form" role on native HTML form elements. luccioman 2016-09-29 13:42:07 +02:00
  • cca4186044 Fixed HTML validation error : "Stray end tag div" luccioman 2016-09-29 11:42:59 +02:00
  • dcdea2d02f Fixed shutdown for crawler.MaxActiveThreads value greater than 200 luccioman 2016-09-29 10:33:11 +02:00
  • ada473ced2 fix ConfigBasic servlet parameter name for Japanese _jp->_ja reger 2016-09-28 16:08:36 +02:00
  • d286ba2c3e Merge branch 'master' of https://github.com/yacy/yacy_search_server.git luccioman 2016-09-28 14:53:08 +02:00
  • b8f6458152 Prevent yacy main thread from hanging on browser opening process. luccioman 2016-09-28 14:52:30 +02:00
  • cf3a4bdf52 upd to pdfbox-2.0.3 reger 2016-09-27 23:12:10 +02:00
  • 70e1eb30a5 prevent StringIndexOutOfBounds in getLocalFile() + tighten patching of DOS path w/o protocol to drive "LETTER": reger 2016-09-27 22:40:36 +02:00
  • 1bb0b135ac Avoid duplication of various MS Windows file URLs flavors luccioman 2016-09-27 07:53:08 +02:00
  • b9a8476f02 Removed unused import luccioman 2016-09-27 07:41:45 +02:00
  • e73c1eea8c remove unused rootpattern, leftover from commit 9a5ab4e2c1 (diff-d2b184283abed53ae260fc9eabdaef40) reger 2016-09-26 02:54:58 +02:00
  • 6f8c3ccea4 improve url hash computation for file path with mixed java & windows file.separator to compute equal hashes (by normalizing path for computation) + expand test case for to check mixed java / windows file url notation like e.g. file:///c:/test/file.html vs. file:///c:\test/file.html - relates partially to http://mantis.tokeek.de/view.php?id=692 reger 2016-09-25 22:08:12 +02:00
  • bac302bfe4 fix NPE in QuickCrawlLink_p if param doesn't contain crawl url reger 2016-09-24 23:33:21 +02:00
  • e9b9a7f68f add missing text for Supporter.html to master.lng reger 2016-09-24 05:02:37 +02:00
  • efcb6a1e74 fix supported mime XML -> xml for rssParser (mime normalized to lower case for comparison) + add mime text/xml as in use for rss in the wild reger 2016-09-23 23:37:12 +02:00
  • b5ba8f9f68 Added alternative text and title to HostBrowser.html image links luccioman 2016-09-23 13:27:46 +02:00
  • 4aba491156 Fixed HTML validation errors : duplicate ids. luccioman 2016-09-22 16:25:47 +02:00
  • 1c139d70d4 Fixed W3C validation error : percent encode '[' and ']' chars in hrefs. luccioman 2016-09-22 16:20:13 +02:00
  • b3b75b0498 Accessibility : add a customizable alternative text to YaCy log luccioman 2016-09-22 16:08:33 +02:00
  • f2bc1b268d Updated URL fragment validation rules according to current standards luccioman 2016-09-22 11:28:33 +02:00
  • b1b8e69da8 Fixed NullPointerException cases luccioman 2016-09-22 11:25:33 +02:00
  • 3ee4f56c39 Improved ErrorCache behavior when switching networks luccioman 2016-09-22 09:07:07 +02:00
  • 7d5ba2afa4 Added some JavaDoc and moved crawlStacker close at the right place. luccioman 2016-09-22 08:21:14 +02:00
  • 8edbcd8ad4 Log eventual Solr instances close errors. luccioman 2016-09-22 08:20:01 +02:00
  • 330768c8a2 fix for solr write.lock after mode change http://mantis.tokeek.de/view.php?id=686 The embedded core holds a lock on the index and must be closed. Earlier commit comment states that core should be closed with solr instance instead on close of connector. Adjusted the InstanceMirror.close() to take care of closing the embedded instance to release the lock. In 2 routines of fulltext this was already explicite implemented (disconnectLocalSolr). Now this disconnect is part of the InstanceMirror.close(). reger 2016-09-22 00:16:22 +02:00
  • 11786457b7 add test case for EmeddedSolrConnector close() for issue http://mantis.tokeek.de/view.php?id=686 (without solving the issue here) reger 2016-09-21 21:08:21 +02:00
  • dc25aee84a fix master.lng entry with targetlang + add text to api/citation.html reger 2016-09-21 01:30:58 +02:00
  • 52e56025f7 Fixed undefined error case in sorttable.js luccioman 2016-09-20 02:44:28 +02:00
  • 585d2a6441 test case: for NewsPool to check the id modificator (for unique id) and observe the distribution order .. hands on. + add test/DATA to gitignor reger 2016-09-20 01:55:56 +02:00
  • de5c873e38 Removed unused JavaScript file docs.min.js luccioman 2016-09-20 00:17:42 +02:00
  • df51e4ef07 Merge branch 'master' of git@github.com:yacy/yacy_search_server.git Michael Peter Christen 2016-09-19 11:01:58 +02:00
  • e063aaf97f enable fuzzy search, solr style (append a ~ to get a fuzzyness on the word) Michael Peter Christen 2016-09-19 11:01:39 +02:00
  • ff6589fc0f test case: simulating multi word query for local rwi index Purpose of the test case is to be able to (controlled) analyse the rwi ranking for multi word searches (with focus on posintext and word-distance ranking) reger 2016-09-18 00:59:27 +02:00
  • e990297d2e avoid NPE on hello message with missing "yourip" key http://mantis.tokeek.de/view.php?id=684 reger 2016-09-15 23:26:25 +02:00
  • e51ab8c7aa hack to generate a unique message-id for messages created in the same second by optionally add a 1 second offset counter to the current time (which is used as the unique id part) reger 2016-09-15 02:59:32 +02:00
  • b82300358a removed version number check because it does not work any more if version numbers are expressed in a different way as we expect. That could cause that YaCy does not run on systems which are appropriate but we simply do not understand the version string. Michael Peter Christen 2016-09-14 16:32:57 +02:00
  • 2107674999 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git Michael Peter Christen 2016-09-14 16:24:55 +02:00
  • 0d28f563f4 fix for java version "9-ea" Michael Peter Christen 2016-09-14 16:24:32 +02:00
  • 24a071e114 Updated compiling section in Readme luccioman 2016-09-14 09:31:45 +02:00
  • 3b694b3935 add some javadoc to rwi wordreference distance, position to remember facts for http://mantis.tokeek.de/view.php?id=683 Init missing word position to 0 like in other non text body words reger 2016-09-14 00:36:19 +02:00
  • a4465c97d6 as requested, disable/remove old swf parser http://forum.yacy-websuche.de/viewtopic.php?f=8&t=5861#p33098 reger 2016-09-13 02:47:36 +02:00
  • 7f63fc50f3 prepare a IndexSegment test case for RWI index testing + prevent NPE in Segment.clear() on missing embedded solr instance. reger 2016-09-11 23:25:44 +02:00
  • 96467c5467 remove not needed counter in Tokeninzer (completing last changes) including a small change, word posintext counting. We remember/store 1st posintext. Previously following words got a handle (posintext) excluding found. Now it just counts and assigns true posintext as handle (posintext) reger 2016-09-10 18:23:09 +02:00
  • ce536fe90b Fixed remaining OpenSearch link name consistency issues luccioman 2016-09-10 09:33:27 +02:00
  • 40442ca3c8 Add title attribute to Translator_p.html edit fields (to display complete text hint) Thank's @luc reger 2016-09-09 22:42:21 +02:00
  • dff7805d16 Updated french translation for some admin pages. luccioman 2016-09-09 09:00:16 +02:00
  • d66b0f7b7b Fixed french messages encoding in YaCy tray. luccioman 2016-09-09 07:43:33 +02:00
  • 44b4d3984e OpenSearch short name : made it generic (non internationalized) luccioman 2016-09-08 17:31:28 +02:00
  • 7efb66ee10 adjust the WordReference.join wordsintext calc to take the max (instead of sum) The reference is for the same url (add same for title and phrases). + del redundant join() procedure reger 2016-09-08 02:29:48 +02:00
  • 71ac0343bb Merge pull request #71 from luccioman/postprocessing luccioman 2016-09-07 10:07:03 +02:00
  • 0a9ff14d96 Fixed NullPointerException case and added Javadoc luccioman 2016-09-07 10:03:48 +02:00
  • 06d4f93d03 Merged master into postprocessing branch luccioman 2016-09-07 09:28:37 +02:00
  • b73d2db914 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git Michael Peter Christen 2016-09-07 06:49:15 +02:00
  • 25a3c7a6d0 catch exception and write end of object Michael Peter Christen 2016-09-07 06:48:52 +02:00
  • 272cdd496a reactivate sentence counter in WordTokenizer for phrasepos ranking, by counting punktuation (delivered as 1 char word) again. reger 2016-09-07 02:16:16 +02:00
  • 5e165a8150 removed unused imports Michael Peter Christen 2016-09-06 18:46:24 +02:00
  • c716648c78 enhanced json encoding of strings Michael Peter Christen 2016-09-06 18:45:29 +02:00
  • 6139bd85a8 fix for broken facet names Michael Peter Christen 2016-09-06 17:19:54 +02:00
  • 5060f9fee9 fix for too long snippets Michael Peter Christen 2016-09-06 09:05:39 +02:00
  • 8681cee3f3 fix for bad comma Michael Peter Christen 2016-09-06 09:00:35 +02:00
  • db6d8fc197 fix for bad json Michael Peter Christen 2016-09-06 07:44:38 +02:00