6b13101d18Collage.html: Fixed problem where the German translation broke the action that contained Collage in a form build.xml: Fixed check for existing private.key, added check for non existing release in target sign and changed the include filenames for changed libs Added log4j.properties file to eliminate the warning about a not initialized log4j subsystem with parameters for one console appender
mikeworks
2010-07-26 20:18:19 +00:00
63c5634b0fadded online documentation for ranking configuration
orbiter
2010-07-26 10:08:21 +00:00
aa663cda4dConfigUpdate_p.html and ConfigUpdate_p.java: Added check for downloaded releases and disabled buttons in case no new releases available de.lng: Updated German translation for additional String in ConfigUpdate_p.html XHTML 1.0 Strict fixes for all the other .html files yacy/ui/css/yacyui-portalsearch.css: added .hidden class that was removed from ConfigProperties_p.html Switchboard.java: Added URL for thread Remote Crawl Job and set URL for Remote Crawl URL Loader to null to fix empty href=""
mikeworks
2010-07-25 13:19:16 +00:00
afd1cd7979*) HTML fixes
low012
2010-07-24 15:42:09 +00:00
0f248e7433ConfigBasic.html: XHTML 1.0 Strict fixes DictionaryLoader_p.html: Filled <dt> elements to eliminate warnings Moved CSS for portalsearch field from header to metas template because it belongs in the <head>er yacui-portalsearch.css Added #yacylivesearch form { display: inline; } because HTML 1.0 Strict does not allow <form><input> and the added <p> would otherwise provoke a line break de.lng: Updates translations for added <dt> elements and deactivated statement in DictionaryLoader_p.html
mikeworks
2010-07-24 15:26:53 +00:00
b12db14b9fAdded Generics to new net.yacy.upnp.* classes to eliminate compiler warnings Added @Deprecated for deprecated functions getIPDevices and getPPPDevices in class InternetGatewayDevice Changed debug statement in Domains.java and corrected filename in comments header
mikeworks
2010-07-24 13:48:45 +00:00
0b89fa2c8d*) HTML fixes
low012
2010-07-24 12:28:48 +00:00
ad96a14d0a*) jump to Crawl Profile editor if a profile is selected to be edited
low012
2010-07-23 17:35:09 +00:00
e4ced6484byacyinteractive.html: - added type="text/javascript" to script resource - removed unintentional "\" from <a> link - changed "name" tag in <form> element to "id" for XHTML 1.0 Strictness (remaining warnings come from script elements writing end tags like </tr> that might confuse some validators)
mikeworks
2010-07-23 06:36:06 +00:00
b7102eff92... migrating to HttpComponents-Client-4.x ...
sixcooler
2010-07-22 23:08:37 +00:00
572e429eff- fixes UPnP not working discussion on forum: http://forum.yacy-websuche.de/viewtopic.php?f=5&t=2881 SVN 6987 fixed net.yacy.upnp.devices.UPNPRootDevice for usage with JxPath > 1.3 by using a default namespace (xmlns="urn:schemas-upnp-org:device-1-0") This commit now fixes the same problem for net.yacy.upnp.devices.UPNPService with default namespace (xmlns="urn:schemas-upnp-org:service-1-0")
mikeworks
2010-07-22 19:13:37 +00:00
52718e6dcb... migrating to HttpComponents-Client-4.x ... monitoring: replaced unused 'idletime' by uploading bytes added some kind of 'upload-throttling' at dht-out :-)
sixcooler
2010-07-22 00:51:41 +00:00
b143f6b169ConfigHeuristics_p.html: XHTML 1.0 Strict Changes - added empty action tag to form - replaced name tags with id (name is not a valid tag in XHTML 1.0 Strict) - changed label for target (so now clicking on the labels also activates the checkboxes) de.lng: Test with Subversion properties #2
mikeworks
2010-07-21 22:40:34 +00:00
22dbbcfa56better (and corrected) recognition of intranet and internet-addresses. This corrects the isLocal property that is used by network definitions to restrict index ranges to local and global addresses. Address locations (intranet or internet) had been partly identified by the top level domain of the host address. Since intranet addresses can also be addressed using a host name that is in a country domain it is necessary to do a dns resolving for each check. The check is supported by a local dns cache so the intranet/internet check should not affect network traffic too much. To ensure that the cache works properly the cache class was upgraded to better concurrency data structures.
orbiter
2010-07-18 20:14:20 +00:00
2d2771a12e*) more HTML fixes
low012
2010-07-18 19:21:59 +00:00
eb8550526d*) fixed small HTML bug
low012
2010-07-18 18:40:41 +00:00
8674a65488removed override directive which caused a compile error in eclipse helios
orbiter
2010-07-18 18:37:20 +00:00
b4d5bb6a3eSteering.html: Changed link from Settings_p.html to ConfigAccounts_p.html for setting not existing Administrator password de.lng: Added missing translations for Steering.html during restart/update
mikeworks
2010-07-18 13:31:44 +00:00
d851758dc6Added German translation for ConfigHeuristics_p.html to de.lng Fixed Network -> Heuristics title tag of the page
mikeworks
2010-07-05 22:58:51 +00:00
43e6ce62afuse heuristics only if user is authenticated
orbiter
2010-07-05 21:52:02 +00:00
dcfb5b942dUpdated German translation for Network.html in de.lng
mikeworks
2010-07-05 21:26:37 +00:00
aa66da5135corrected hint for debian installation update
orbiter
2010-06-30 14:31:16 +00:00
7188c54ddbpatch to get dht access to developer peers
orbiter
2010-06-30 08:42:29 +00:00
25024d6ab2fix for problen when accessing the metadata index. The index was not available for all peers with no RAM table copy.
orbiter
2010-06-30 07:22:50 +00:00
8e88fa4a62*) fixed indetion (tab vs. spaces) *) added Android packages MIME type
low012
2010-06-29 21:31:22 +00:00
b6fb239e74redesign of parser interface: some file types are containers for several files. These containers had been parsed in such a way that the set of resulting parsed content was merged into one single document before parsing. Using this parser infrastructure it is not possible to parse document containers that contain individual files. An example is a rss file where the rss messages can be treated as individual documents with their own url reference. Another example is a surrogate file which was treated with a special operation outside of the parser infrastructure. This commit introduces a redesigned parser interface and a new abstract parser implementation. The new parser interface has now only one entry point and returns always a set of parsed documents. In case of single documents the parser method returns a set of one documents. To be compliant with the new interface, the zip and tar parser had been also completely redesigned. All parsers are now much more simple and cleaner in its structure. The switchboard operations had been extended to operate with sets of parsed files, not single parsed files. additionally, parsing of jar manifest files had been added.
orbiter
2010-06-29 19:20:45 +00:00
59c894029bremoved confusing double set button in ConfigHeuristics
orbiter
2010-06-28 22:27:20 +00:00
d4851441b0*) Added Android packages to parser in order to be able to create a decentralized search for direct downloads of Android apps.
low012
2010-06-28 20:41:08 +00:00
150cf42a1bmigrated all my LGPL 3 -licensed files to the LGPL 2.1 because LGPL 3 is not compatible to the GPL 2 see http://www.gnu.org/licenses/license-list.html for explanation Since (as far as I know) nobody else has ever contributed to these files I may be allowed to just apply an older license. You may consider this as a dual-licensing and may use and optionally replicate the older files under GPL 3.
orbiter
2010-06-28 16:25:14 +00:00
11b7853940added a configuration page for search heuristics. currently you can switch on there: - a site-operation heuristic that loads all direct links from a portal page if the site-operator is used - a direct crawl for search results from scroogle for the given search terms The configuration page can be found directly beside the network configuration page
orbiter
2010-06-27 21:38:16 +00:00
5d00888c95- added animated visualization for DHT-in and DHT-out in network graphic - found and fixed a possible memory leak in YaCy internal RSS feed system - some refactoring in RSS feed mechanisms to make this possible
orbiter
2010-06-27 10:45:20 +00:00
bf25407fddadded peer hash to internal RSSFeed. The hash will be used to display news activities in the network graphic.
orbiter
2010-06-26 23:10:57 +00:00
1557e0f2d0- some refactoring for internal RSSFeed (protocol of all actions as seen on status page) - added dht-out to internal RSSFeed (you can see now messages about distributed indexes on status page)
orbiter
2010-06-26 22:39:27 +00:00
5a4684f21fallow words with length >= 2 (you can't search for 'wm' with 3-letter words...) lets try that. If we run into a memory problem because of too many 2-letter-words, then we must introduce whitelists for 2-letter words.
orbiter
2010-06-26 16:31:26 +00:00
b5e190099d- updated pdfbox and fontbox to 1.1.0 - added license file to sbbi-upnplib
orbiter
2010-06-26 10:58:07 +00:00
37b8827a7a- removed the UPnP library sources from sbbi and added the jar library again. The library was included to get support for fedora releases, but after this time the fact that the sbbi cannot be part of fedora should be re-discussed. If this will still not be possible, then we may integrate the sbbi UPnP package using reflection. - cleaned uo the code. The new eclipse helios provided new warnings for dead code. This change cleans up most of these warnings
orbiter
2010-06-26 10:32:47 +00:00
dcd01698b4added a 'transition feature' that shall lower the barrier to move from g**gle to yacy (yes!):
orbiter
2010-06-25 16:44:57 +00:00
d5d48b8dc7enhanced network animation (smooth loading, reload not all 4 animation phases at once)
orbiter
2010-06-24 15:01:26 +00:00
103c848af8enhancements in image drawing speed
orbiter
2010-06-24 13:20:45 +00:00
3a9dc52ac2added a fascinating new way to search _and_ start a web crawl at the same time: implemented a hint from dulcedo "use site: - operator as crawl start point". YaCy already was able to search using a site-constraint. This function is now extended with a instant crawling feature. When you now use the site-operator, then the landing page of the site iand every page that is linked from this page are loaded, indexed and selected for the search result within that search request. When the remote server responds quickly enough, then this process can result in search results during the normal search result preparation .. just in some seconds.
orbiter
2010-06-23 11:19:32 +00:00
8e3cbbb6a9more animation: update of network image every 10 seconds
orbiter
2010-06-23 10:29:04 +00:00
d7767e7589IFFRESH is too strong, IFEXIST sufficient for cache policy when doing a link verification (this is as it was two commits before)
orbiter
2010-06-22 19:16:26 +00:00
777195e8d1more abstraction for access of LoaderDispatcher and cache
orbiter
2010-06-22 12:28:53 +00:00
7bcfa033c9more abstraction of the htcache when using the LoaderDispatcher: a cache access shall not made directly to the cache any more, all loading attempts shall use the LoaderDispatcher. To control the usage of the cache, a enum instance from CrawlProfile.CacheStrategy shall be used. Some direct loading methods without the usage of a cache strategy have been removed. This affects also the verify-option of the yacysearch servlet. If there is a 'verify=false' now after this commit this does not necessarily mean that no snippets are generated. Instead, all snippets that can be retrieved using the cache only are presented. This still means that the search hit was not verified because the snippet was generated using the cache. If a cache-based generation of snippets is not possible, then the verify=false causes that the link is not rejected.
orbiter
2010-06-21 14:54:54 +00:00
fd9f0714a3added link verification, global search and navigation to opensearch description. Hint: the YaCy search can easily be integrated into the firefox search window: Just start a search, then open the pop-up menu inside the firefox search input window and select "add search engine"
orbiter
2010-06-20 11:04:11 +00:00
7e2d6fac12patch for bad values during local search join
orbiter
2010-06-20 00:31:00 +00:00
2ddb952a5cadded the (fixed and anhanced) secondary search process. The process was disabled since some time. The search process for more than one word should be enhanced now and produce much more results.
orbiter
2010-06-20 00:11:12 +00:00
58035ef784fix in snippet loading
orbiter
2010-06-18 19:36:11 +00:00
986d4f34d9added a consistency check for new queues
orbiter
2010-06-18 18:59:42 +00:00
73f03e05eefixed a bug in snippet fetch strategy: cache only does not help if resource can only be found in web
orbiter
2010-06-18 15:25:25 +00:00
fbf021bb50redesign of index abstract processing - currently disabled until enough peers have fix in SVN 6928
orbiter
2010-06-18 09:44:21 +00:00
090eae2cf5fix for broken index abstract generation
orbiter
2010-06-18 09:25:44 +00:00
e78fd21fcaAdded German translation in de.lng for new DictionaryLoader_p.html Geolocalization component loader page
mikeworks
2010-06-18 06:36:18 +00:00
87087f12fe- scanned remote search process and enhanced some data structure and synchronizations here and there - removed concurrency overhead for small number of index normalizations as it happens during remote search - removed 'load only parseable' constraint for snippet fetch because some resources may not have any url file extension and these had therefore not been parseable and searcheable since they may become parseable after loading when their mime type is known - this partly fixes some problems with http://forum.yacy-websuche.de/viewtopic.php?p=20300#p20300 but more changes are necessary to get all expected search results
orbiter
2010-06-17 11:59:40 +00:00
7ddb70e7c6new license for ai.greedy component: LGPL (nobody else than me modified that code)
orbiter
2010-06-16 22:16:03 +00:00
b62fb38344fix for case where no release provider responds during auto-update (caused NPE)
orbiter
2010-06-16 18:43:45 +00:00
989819a28c- reduced peer-ping time-out from 30 to 10 seconds - no re-try for the peer ping any more (it's a test, let's see what happens)
orbiter
2010-06-16 08:30:13 +00:00
51332b787dreverted SVN 6869 as discussed with dulcedo in car after LinuxTag: missing time-out may be cause of locks during DHT-out
orbiter
2010-06-15 20:30:53 +00:00
4eab6473d3option to set more than 9999 MB RAM in input field :-)
orbiter
2010-06-15 20:26:15 +00:00
b03caaa57abetter handling of OOM situations
orbiter
2010-06-15 19:44:05 +00:00
56ff9d5fd4- extended news size from 512 to 1024 characters - a new news db will be created (news1024.db), the old one (news.db) can be deleted - peers with too large news payload are not ignored any more (they may have been invisible because they had a too large news payload!)
orbiter
2010-06-15 10:43:47 +00:00
353a924760- changed default memory to 500m - now xms is lower than xmx (lets try what happens) - removed default path for intranet crawl starts to avoid confusion as seen on linuxtag - added time-out to upnp request (i have a new router which may need that)
orbiter
2010-06-14 21:36:40 +00:00
0f3a3e34e1Updated German translation de.lng and fixed typos in html files (english)
mikeworks
2010-06-11 13:51:49 +00:00
5251a18e65de.lng: Added new Network.html German translations Network.html: shortened some <br /> tags to <br/> ConfigBasic.html fixed some typo cann for German translation file
mikeworks
2010-06-11 00:17:23 +00:00
a33f39832e- small change in display of use cases - explain usage of ftp, smb and file search domains
orbiter
2010-06-06 23:26:04 +00:00
60e71876ad- more abstraction (HashMap -> Map) - more concurrency-awareness (HashMap -> ConcurrentHashMap)
orbiter
2010-06-01 13:02:11 +00:00
a83772c71bfixes and enhancements for balancer: - crawl lists for each domain now uses a HandleSet which should use less memory than LinkedLists - but: fill more entries into the domain lists (all available entries) - fixes to selection criteria (best domain selection)
orbiter
2010-06-01 09:30:23 +00:00
9cde05418ffixed url crawl list display
orbiter
2010-05-31 00:27:00 +00:00
2eea806005less errors in image parser
orbiter
2010-05-30 11:18:05 +00:00
30b337fa9ffixes to balancer when crawling filesystem (problem was: host == null)
orbiter
2010-05-30 11:17:38 +00:00
844853243afixed balancer time guessing
orbiter
2010-05-30 10:28:42 +00:00
2e679b1302Small Fixes - no functional Changes
suessthomas
2010-05-27 21:01:22 +00:00
3f93a0cc8fredesign of remote proxy settings
orbiter
2010-05-26 00:01:16 +00:00
11639aef35- added new protocol loader for 'file'-type URLs - it is now possible to crawl the local file system with an intranet peer - redesign of URL handling - refactoring: created LGPLed package cora: 'content retrieval api' which may be used externally by other applications without yacy core elements because it has no dependencies to other parts of yacy
orbiter
2010-05-25 12:54:57 +00:00