1
0
mirror of https://github.com/yacy/yacy_search_server.git synced 2025-06-24 04:36:08 -04:00
Commit Graph

8950 Commits

Author SHA1 Message Date
fcc9386ed3 enhanced the (already fast!) png exporter 2020-12-03 12:18:07 +01:00
4e9b425f98 missing fix for latest commit 2020-12-03 00:40:51 +01:00
3213d9db37 updated jetty from 9.4.17 to 9.4.35
and fixed a bug in ServerSideIncludes that appeared only in that recent
version of jetty
2020-12-03 00:21:15 +01:00
787fec0658 reduced complexity - removed concurrency in sort 2020-12-02 18:39:45 +01:00
cef5fde343 adding message to UI to make port change transparent 2020-12-02 18:05:38 +01:00
52228cb6be added a gc to cleanup process (once every 10 minutes) 2020-12-02 00:13:00 +01:00
22841ffbf1 creating a threaddump during every cleanup process
to be able to find out what a peer did (not) last time before a crash
2020-12-01 03:00:24 +01:00
36e616271b do better documentation on how to set a default password 2020-12-01 02:18:08 +01:00
df2bf9ef28 try to fix maven build error 2020-11-29 14:24:33 +01:00
264bab6700 trying to fight the UI unavaiability
this path addresses a possible issue with too many open connections to
remote peers
2020-11-29 14:15:34 +01:00
7947baeb49 removed all remaining deprecation warnings 2020-11-23 00:03:18 +01:00
c0f6d6e11d removed one deprecation warning for jetty library initializing ssl
server port
2020-11-22 23:27:58 +01:00
133440a7a6 some debug lines 2020-11-22 23:12:04 +01:00
3431f91db9 removes unused 'unused' tokens 2020-08-04 20:09:34 +02:00
fc03c4b4fe removes some warning and unused objects 2020-08-03 20:44:31 +02:00
4a495df63a removes some deprecation-warnings 2020-07-31 17:28:06 +02:00
dd9d4b1188 replace org.junit.Assert.assertThat by
org.hamcrest.MatcherAssert.assertThat from hamcrest 2.2 to avoid
deprecation-warning
2020-07-28 19:09:26 +02:00
df9ea0a42a removes some warnings: unused imports, params 2020-07-27 22:20:49 +02:00
9bc2297161 fixes deleting during recrawl 2020-07-22 22:15:00 +02:00
80785b785e adds deleting during recrawl 2020-07-09 19:32:16 +02:00
e0ad8ca9da replaced json library from JSON.org with libandroid-json-java
This fixes https://github.com/yacy/yacy_search_server/issues/347
2020-04-24 11:45:25 +02:00
ea8df27e95 modified org.json.* library to fit into the YaCy environment
as drop-in replacement.
Also made some fixes and enhancements to the library.
2020-04-24 11:42:06 +02:00
60dc1241a3 added org.json.* library
from https://android.googlesource.com/platform/libcore/+/refs/heads/master/json/src/main/java/org/json
as a preparation step for
https://github.com/yacy/yacy_search_server/issues/347
2020-04-24 10:28:43 +02:00
053e54a2c7 grand CORS for json files 2019-11-05 11:50:56 +01:00
cfa27d2fd5 fixed links 2019-10-20 20:20:50 +02:00
cb20aa7e54 removed donation message in search result column 2019-10-17 01:35:44 +02:00
25227676ae removed some warnings 2019-09-28 02:07:08 +02:00
6b45cd5799 New optional crawl filter on the URL a doc must match to crawl its links
For finer control over which parsed documents can trigger an addition of
their links to the crawl stack, complementary to the existing crawl
depth parameter.
2019-05-01 08:54:19 +02:00
d16bc99835 Added "Show Metadata" links to the ViewFile.html links mode
To conveniently follow parsed links in the file viewer
2019-04-18 15:31:38 +02:00
a5771b1f14 Made SNI extension user configurable without the need for server restart
TLS Server Name Indication (SNI) extension activation can now be
configured with the new Settings_p.html?page=httpClient administration
page.
SNI extension is also now enabled by default, as in 2019 the
unrecognized_name(112) alert is more properly handled by major web
servers TLS implementations, following the RFC 6066 standard.

Related YaCy issues :   and 
JDK 1.7 bug :
https://bugs.java.com/bugdatabase/view_bug.do?bug_id=7127374
Apache httpd issue :
https://bz.apache.org/bugzilla/show_bug.cgi?id=56241
RFC 6066 : https://tools.ietf.org/html/rfc6066#section-3
2019-04-14 15:41:13 +02:00
e90405b6f0 Support parsing audio URLs without file extension
Added also a Junit for the audio tag parser
2019-04-09 11:40:21 +02:00
a8316c79da Allow JS resorting of search results by unauthenticated users
Acces rate limitations to this search mode by unauthenticated users are
set low by default to prevent unwanted server overload but can be
customized through the SearchAccessRate_p.html configuration page

Fixes 
2019-04-03 14:21:53 +02:00
0ab2b49c31 Made /yacysearch access rate limitations user configurable
With a new admin page at /SearchAccessRate_p.html in menu Network Access
> Local Search > Access Rate Limitations
2019-04-02 17:42:50 +02:00
5b7e41202a Added Solr GSA writer support for responses from remote instances 2019-03-27 18:23:41 +01:00
4d8a948455 Properly close PDF snapshots loaded with pdfbox library 2019-03-22 09:50:30 +01:00
74e6d6e984 Added Solr GrepHTML writer support for responses from remote instances 2019-03-20 18:24:16 +01:00
5e6501974d Added Solr snapshots writer support for responses from remote instances 2019-03-19 11:25:44 +01:00
384c37102c Improve accuracy of total results count on latest pages in Stealth mode
Previously, when mixing results from local RWI and local Solr (Stealth
mode), total local Solr count could be ignored on last result pages,
when the page offset was higher than local Solr count but lower than
total RWI count.
2019-03-04 10:05:47 +01:00
5e9a08355a Improved logging for federated search
- Do not use spaces in logger identifier name so the log level can be
configured in yacy.logging
- Hold the logger instance to avoid the logging system to look for it
from its name at each appended log message
2019-02-02 09:59:24 +01:00
9782a98a9c Added the possibility to customize facets sort type and direction
Previously search navigators/facets elements were sorted only by counts.
Now from the ConfigSearchPage_p.html admin page, sort direction
(ascending/descending) and type (on counts or labels) can be customized
independently for each navigator.
2019-01-24 18:43:06 +01:00
c2398fd890 remove warnings: 'Statement unnecessarily nested within else clause' 2019-01-10 20:02:57 +01:00
811d40a6c4 taking care of closing inputstreams, HTTPClient 2019-01-04 18:58:49 +01:00
8d2e7262d9 Recrawl:
- set the chunksize to 100 to meet the max of the embedded solr
- re-enable sorting (the case where we switched it of should be away)
- enable recrawling on remote-solr
2019-01-04 18:46:59 +01:00
8f58c1dcfa extend the SolrServlet to be usable as remote solr (incl. update)
this feature needs to be enabled by uncomment the url-pattern
2019-01-04 18:27:44 +01:00
7223a2fdb1 Removed usage of now deprecated Jetty function 2018-12-22 14:42:22 +01:00
440d9f2fa0 Exclude peers with empty or disabled RWI from remote RWI search 2018-12-20 14:53:01 +01:00
08ea0b0397 Added a configurable timeout to wkhtmltopdf calls for pdf snapshots
Necessary to prevent blocking the indexing workflow when some
wkhtmltopdf renderings fail without terminating
2018-12-11 22:31:31 +01:00
3fb449b3b6 Properly resolve relative URLs against document URL in html base tags
Fixes issue 
2018-12-06 20:18:00 +01:00
73a6e45524 Extended detection of external tools used for Snapshots generation
This enable detecting wkhtmltopdf and Imagemagick convert executables
when they are at system Path in addition to common installation paths.
2018-12-06 09:53:08 +01:00
7dc1f60619 Fixed detection of absolute data folder path on MS Windows 2018-11-18 10:08:20 +01:00