Commit Graph

  • 89124335c4 update bookmark autosearch description - add german translation reger 2015-05-10 02:29:08 +02:00
  • fbf85a1561 added temporary debug output in http client Michael Peter Christen 2015-05-08 15:31:01 +02:00
  • ff29b0e503 added option to re-index exported xml snapshot dumps to HTCACHE/snapshots by just placing them in the SURROGATES/in path Michael Peter Christen 2015-05-08 15:30:26 +02:00
  • 6f4fe4b175 revert of 8a7c68e4c7 keeping surrogates after processing is essential for some users. If the space they are taking is too high, please set up an automatic deletion process (like a cronjob). Michael Peter Christen 2015-05-08 14:01:30 +02:00
  • 213401a446 Merge branch 'master' of git@github.com:yacy/yacy_search_server.git Michael Peter Christen 2015-05-08 13:48:29 +02:00
  • 97930a6aad added must-not-match filter to snapshot generation. also: fixed some bugs Michael Peter Christen 2015-05-08 13:46:27 +02:00
  • 9d8f426890 adding a try-catch to link graph processing to prevent that a single malformed url interrupts the storage process Michael Peter Christen 2015-05-08 10:38:33 +02:00
  • b47267b79c precaution against NPE on createorgetBookmark on search result reger 2015-05-07 03:25:19 +02:00
  • 75879e051b Merge branch 'master' of git@github.com:yacy/yacy_search_server.git Michael Peter Christen 2015-05-03 03:03:45 +02:00
  • 8a5b8f8789 on bookmaring of search result, remember orig. query in separate bookmark property (instead of using the description field) - adjust display and autosearch - don't overwrite existing bookmark but combine info reger 2015-05-03 02:31:50 +02:00
  • 7224209486 break out of NormalizeDistributor loop on timeout reger 2015-05-02 02:36:18 +02:00
  • cf1fc7f700 harmonize filesearch input box layout reger 2015-05-01 19:24:14 +02:00
  • 4d73e9de06 upd to metadata-extractor-2.8.1 reger 2015-04-30 00:01:11 +02:00
  • e334a06370 Merge branch 'master' of git@github.com:yacy/yacy_search_server.git Michael Peter Christen 2015-04-29 10:51:09 +02:00
  • 0904a041a6 upd to poi-3.11.jar reger 2015-04-29 01:53:04 +02:00
  • 47e61f8325 fix typo in image filter query (extra bracket) reger 2015-04-28 03:12:14 +02:00
  • 4b4ab6799f fix String out of range in Collection Nav see http://mantis.tokeek.de/view.php?id=573 reger 2015-04-27 22:38:40 +02:00
  • 572cfe8fd4 improve character encoding for urlproxy servlet for none utf-8 pages reger 2015-04-26 17:42:39 +02:00
  • b161473cd0 upd to jsoup-1.8.2 reger 2015-04-26 17:41:05 +02:00
  • 6bc8a9b11e make Quality of Service Servlet available to prioritize requests from local host This assigns priorities to incoming requests. Higher priority numbers are served before lower. (disabled by default in defaults/web.xml, uncomment or copy entry to DATA/Settings/web.xml) reger 2015-04-26 04:29:32 +02:00
  • af2d66e3d8 correct typo in de.lng reger 2015-04-25 22:38:38 +02:00
  • 71bf95af8a upd parser calls in test cases reger 2015-04-25 03:24:28 +02:00
  • 579303a04e add additional links to crawl queue pages reger 2015-04-25 02:45:05 +02:00
  • 99718dc09a don't record dump generation calls since that - is not a change of the index - happens very often within self-backup strategies from the outside (i.e. cronjobs) Michael Peter Christen 2015-04-23 18:17:28 +02:00
  • 5b59477415 update to bootstrap.css 3.3.4 Michael Peter Christen 2015-04-23 06:36:57 +02:00
  • 016b4e58ac Merge pull request #4 from dertuxmalwieder/master Michael Peter Christen 2015-04-21 10:13:57 +02:00
  • 82a8c282f7 Readme improvements Sven Knurr 2015-04-20 14:30:45 +02:00
  • 0d365e67a5 Merge pull request #2 from Scarfmonster/master Michael Peter Christen 2015-04-20 10:24:34 +02:00
  • 1ded4b4889 Merge pull request #3 from shaman/master Michael Peter Christen 2015-04-20 10:23:39 +02:00
  • 4f1cb7c65c fix typos Eugene Kuligin 2015-04-20 05:27:15 +03:00
  • 8ae3229306 add vertical margin to the search cloud block Eugene Kuligin 2015-04-20 05:24:50 +03:00
  • f9408dfa48 fix RSS icon displaying Eugene Kuligin 2015-04-20 05:19:37 +03:00
  • 62c95c6759 increase view space with normal font weight in the searched titles Eugene Kuligin 2015-04-20 05:13:05 +03:00
  • f7b0148f6a fix NPE in Vocabulary_p servlet called w/o parameter reger 2015-04-20 00:01:14 +02:00
  • ca1a70aec8 fix for Accept '?' URLs column in Crawl Profile List Ryszard Goń 2015-04-19 15:55:49 +02:00
  • b0cd0212fd SynonymLibrary status check fix for multiple files Ryszard Goń 2015-04-17 15:14:10 +02:00
  • f3f1b2e899 added English synonyms Ryszard Goń 2015-04-17 16:15:35 +02:00
  • 5408448a56 skip redundant add. of keywords to text search uses keywords as default search field reger 2015-04-17 02:14:13 +02:00
  • 296e97c78e put https port in peers dna as we flag if a peer is accesible via https, we need to know the port if we want to use is (e.g. for interYaCy communication) start to provide / tansport the port by recording it in peers dna. - add https link on the Network.html lock symbol reger 2015-04-16 02:36:12 +02:00
  • 088853c1e8 Merge branch 'master' of git@github.com:yacy/yacy_search_server.git Michael Peter Christen 2015-04-15 13:17:37 +02:00
  • fed26f33a8 enhanced timezone managament for indexed data: to support the new time parser and search functions in YaCy a high precision detection of date and time on the day is necessary. That requires that the time zone of the document content and the time zone of the user, doing a search, is detected. The time zone of the search request is done automatically using the browsers time zone offset which is delivered to the search request automatically and invisible to the user. The time zone for the content of web pages cannot be detected automatically and must be an attribute of crawl starts. The advanced crawl start now provides an input field to set the time zone in minutes as an offset number. All parsers must get a time zone offset passed, so this required the change of the parser java api. A lot of other changes had been made which corrects the wrong handling of dates in YaCy which was to add a correction based on the time zone of the server. Now no correction is added and all dates in YaCy are UTC/GMT time zone, a normalized time zone for all peers. Michael Peter Christen 2015-04-15 13:17:23 +02:00
  • f6a55f9279 incoming connection count/text fix improvement on http://mantis.tokeek.de/view.php?id=570 reger 2015-04-15 02:16:53 +02:00
  • 3e338e0987 Merge pull request #1 from Scarfmonster/master Michael Peter Christen 2015-04-14 10:33:43 +02:00
  • 702c30e619 add info text icon next to Augmented Browsing check-box with hint to config page reger 2015-04-14 03:19:27 +02:00
  • 4c907bec89 show "Augmented Browsing" link in search result only if urlproxy allowed and option switched on in layout (AugmentedBrowsing_p.html, ConfigSearchPage_p.html) as user only gets a error page if the option is not enabled reger 2015-04-14 02:07:02 +02:00
  • 6d78a6d06e Search navigation fix Ryszard Goń 2015-04-13 23:32:06 +02:00
  • b060ba900d added parsing of contentprop attribute in html tags for content='startDate' and content='endDate'. The value of these field is now written to new solr fields startDates_dts and endDates_dts. Michael Peter Christen 2015-04-13 16:20:00 +02:00
  • a08a3c5f29 reverted json syntax for facet results to version from january Michael Peter Christen 2015-04-13 16:18:15 +02:00
  • 4cb4f67f38 added parsing of dd, dt and article html fields. The parsed result is written to special solr fields which are deactivated by default. Michael Peter Christen 2015-04-12 22:02:45 +02:00
  • 1395f10e95 fix typecast for css links reger 2015-04-12 01:11:47 +02:00
  • 3288489fd2 more logging during start-up Michael Peter Christen 2015-04-11 13:00:32 +02:00
  • abaaaef5f1 fix for filter queries Michael Peter Christen 2015-04-11 12:30:29 +02:00
  • d8cc773d05 fix for not valid json in case that topics are switched off Michael Peter Christen 2015-04-11 12:20:29 +02:00
  • 4d00175157 <experimental> added parsing of <article> html element. Whenever such an element occurs, the complete content of all article elements replaces the parsed <content> part of documents. Michael Peter Christen 2015-04-10 16:16:20 +02:00
  • 1df6492019 enhanced suggestions Michael Peter Christen 2015-04-10 15:59:18 +02:00
  • c7fdde3bd1 replaced "fork me" banner with github banner Michael Peter Christen 2015-04-10 15:10:18 +02:00
  • 876cdb083f Merge branch 'master' of github.com:yacy/yacy_search_server Michael Peter Christen 2015-04-09 14:31:16 +02:00
  • 699a81ae01 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git Michael Peter Christen 2015-04-09 14:29:11 +02:00
  • ae02c92fd0 logging fix Michael Peter Christen 2015-04-09 14:21:23 +02:00
  • 642daad528 upd to httpcore 4.4.1 reger 2015-04-08 22:42:30 +02:00
  • 5651713134 better debugging of fq Michael Peter Christen 2015-04-07 17:02:02 +02:00
  • f5a032f293 split query into filter query and text query to get better ranking results and faster results Michael Peter Christen 2015-04-07 16:10:13 +02:00
  • 36e9cdb376 testing switching off cold searchers; maybe this brings performance enhancements when using large facets Michael Peter Christen 2015-04-07 13:14:41 +02:00
  • 2e88028c1a when selecting collections in navigation, do show the un-selected collections in search result. When selecting one of them in another search, switch off the previously selected collection. This actually turns the collection navigation modifier into a radio-button like behaviour Michael Peter Christen 2015-04-07 13:13:58 +02:00
  • 1de9b21c65 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git Michael Peter Christen 2015-04-07 12:40:43 +02:00
  • 5f4cd8d6f5 replace deprecated getIP with getIPs in AbstractRemoteHandler reger 2015-04-07 00:10:42 +02:00
  • 01759e9af9 upd to PDFBox 1.8.9 reger 2015-04-05 23:38:14 +02:00
  • c5398c3c88 include htroot (*.class) in maven clean harmonize antrun javac call in pom with build.xml reger 2015-04-04 21:11:09 +02:00
  • 2f592a8063 add SynonymLibrary status to DictionaryLoader_p servlet http://mantis.tokeek.de/view.php?id=564 reger 2015-04-04 00:24:16 +02:00
  • fa7edc9f7a refactoring of filter queries (several queries instead only one) Michael Peter Christen 2015-04-02 13:27:47 +02:00
  • c59ebde083 show location nav as selectable nav in search page layout - switch automatically on upon load of geodata provider - but allow switch on also without geodata file (and display the location nav if search result has lat/lon location) reger 2015-04-02 02:10:00 +02:00
  • 5bc1e5cfbf use a cursor hand on facet headline to show that this is clickable Michael Peter Christen 2015-04-01 18:37:45 +02:00
  • 40389987ec Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git Michael Peter Christen 2015-04-01 18:18:05 +02:00
  • f9ba50379d added an expansion option to search facets on result page: - if less or equal of 8 facet options are present, they are shown by default - if more facet options are present, they are hidden To view or hide all facets, just click on the facet header bar Michael Peter Christen 2015-04-01 18:17:52 +02:00
  • 1f0f77bb77 make location facet return results for location nav facet of field coordinate_p does not return results, now using coordinate_p_0_coordinate as alternative to get facet counts. As the actual facet value is not used this should not harm any analysis (even if facet is a incomplete location). If facet value is used in future likely *_geohash field could be introduced (for facet and other ... as transport value) reger 2015-04-01 01:57:56 +02:00
  • b1ec0644e5 fix NPE in location search on missing/empty PubDate in underlaying rss data reger 2015-03-31 02:20:13 +02:00
  • c1dcc8c456 fix display and limit of max server connections after startup (on restart value returned to default=50) This has no effect on Jetty but the limit is still respected. reger 2015-03-29 07:12:23 +02:00
  • 2f84b04fa9 add err msg on failure during Load_rss reger 2015-03-29 05:48:54 +02:00
  • 96292cf3eb shorten exception loggin on not available connection in Load_RSS_p servlet reger 2015-03-28 21:12:00 +01:00
  • 839b962c20 correct percent encoding for '%' char reger 2015-03-28 03:05:21 +01:00
  • 66d0b5046a fix NPE on viewfile of url not in index reger 2015-03-26 00:21:31 +01:00
  • 5789c96292 fix: banner did not show link and qph for portal mode Michael Peter Christen 2015-03-25 13:21:36 +01:00
  • 9bf0d7ecb9 added a new collection type 'dht' to all documents from the peer-to-peer interface to distinguish rich and poor document data. This also reverts some changes from commit 796770e070 because the firstSeen database is the wrong method to distinguish these types of data Michael Peter Christen 2015-03-24 12:32:39 +01:00
  • 7fcf0d0b71 fix missing display of CrawlerMonitor -> robots.txt Monitor revert delete of file api/table_p.html see 3ffe19b85c (still used in this menu) reger 2015-03-24 00:13:05 +01:00
  • efadb710a4 Updated Git links from Gitorious to Github. Marc Nause 2015-03-23 11:12:39 +01:00
  • 796770e070 prevent overwrite of crawled or received full documents by (newer) metadata To protect rich index data (full resource) from overwriting by metadata gathered during remote search, the newly introduced "firstSeen" index is used to differentiate between full-resource-doc and metadata, as a "firstSeen" entry is only added on store's of full-resource-docs (during crawl or remote search). reger 2015-03-23 03:57:47 +01:00
  • 7cf28c4f94 upd to Jetty 9.2.10 reger 2015-03-22 02:47:12 +01:00
  • ee2490ab98 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git Michael Peter Christen 2015-03-19 10:42:57 +01:00
  • 431311df42 fix get fresh_date_dt to allow returned value to be date in future reger 2015-03-18 22:04:03 +01:00
  • 74c7e8b686 Fixes hanging FlushThread (see http://forum.yacy-websuche.de/viewtopic.php?f=5&t=5447) by replacing put() method by the more robust add() to add a merge job to the queue. otter 2015-03-18 21:57:41 +01:00
  • f63fff9008 fix snippet containig number with comma as desmo point http://mantis.tokeek.de/view.php?id=344 to keep it as one word (by altering the split regex) - added sniipet test case with number - regex for word split to match multiple splitcars reger 2015-03-16 02:03:40 +01:00
  • b241264632 fix error on *abc query input http://mantis.tokeek.de/view.php?id=486 reger 2015-03-15 22:31:47 +01:00
  • 2ef8ffdb60 apply UTF-8 encoding copied from escape() reger 2015-03-15 06:02:45 +01:00
  • 7120ea42f1 fix for path with char code > 255 (causing index out of bound exception) + test cas for it reger 2015-03-15 03:37:32 +01:00
  • 1d81bd0687 fix url encoding for path see http://mantis.tokeek.de/view.php?id=559 So far we used same escape procedure for all parts of the url (which includes x-www-form-urlencoded for all url components) Added capability to use different encoding rules for the different url components (through specific bitset for each component). (this is inspired by org.apache.http.client and java.net.uri implementation). - Added test case for http://mantis.tokeek.de/view.php?id=559 reger 2015-03-15 00:46:07 +01:00
  • 62087fb8b2 fix MultiProtocolURL mailto protocol detection reger 2015-03-13 02:02:53 +01:00
  • 65f8371163 fix link to DeReWo project page reger 2015-03-11 21:28:57 +01:00
  • 2e8c24e02a fix link to DeReWo download file reger 2015-03-11 20:02:23 +01:00
  • 706f75ddc2 try to fix hang on index blob merge on shutdown http://mantis.tokeek.de/view.php?id=505 It happens but not able to reproduce. This change makes sure terminate signal is catched at end of currently running merge jobs reger 2015-03-11 19:36:23 +01:00
  • f94e34058c fix url (path) %-decoding http://mantis.tokeek.de/view.php?id=519 - add test case for this reger 2015-03-11 01:05:14 +01:00