Commit Graph

  • 4e2c14efbb fixed bugs in parser and ftp client orbiter 2010-12-02 11:05:04 +00:00
  • d78e322e84 added a directory-structure reader to ftp client orbiter 2010-12-02 08:08:01 +00:00
  • f0651e5f2f added image search to yacyinteractive.html this causes that the search result view switches from list format to image preview format when a search is restricted to png, gif or jpg documents orbiter 2010-12-01 18:48:21 +00:00
  • fffb91447a fixed crawl queue delete function orbiter 2010-12-01 14:55:40 +00:00
  • 4e771e2063 enhanced interactive search: - better table design - less enumeration of same table structure (prepared now for streaming) - added a 'remove filetype' link orbiter 2010-12-01 14:43:07 +00:00
  • b769cce433 - added a catch-all parser for all documents that cannot be parsed: they will contributed with their document url for the search index only - enhanced the pdf and torrent parser: better documents titles - enhanced the ftp client: more time-out time - fixed bugs in json for search results - enhanced yacyinteractive.html: added a file type navigator and a download-script generator for search result files orbiter 2010-11-30 16:13:55 +00:00
  • 6692e862ae do not reset language on config change lotus 2010-11-29 20:09:53 +00:00
  • 21e84539e8 one more fix to Domains orbiter 2010-11-29 19:49:57 +00:00
  • e192d61972 fix for latest commit orbiter 2010-11-29 19:32:47 +00:00
  • 22453b13ad implemented local host address discovery as posted in http://forum.yacy-websuche.de/viewtopic.php?p=21310#p21310 orbiter 2010-11-29 19:18:44 +00:00
  • cc6499bf8d - added http://blekko.com as search heuristic (like scroogle). This was easy since they deliver their search results also as rss feed - renamed YaCys search result modifications keywords for RECENT, NEAR and language: to the blekko slashtag naming scheme. YaCy now supports the following blekko-like slash built-in slashtags: /date - for search results ordered by date (most recent up) /near - for search results where search words appear near to each other (closest up) /language/<lang> - for a sorting by language where the wanted language gets up. Example: /language/de orbiter 2010-11-29 18:08:20 +00:00
  • a9f754c45f removed unused CR accumulation and distribution process this was never used and extended in the last years. The resulting YBR ranking criteria is still a good idea and will be used in the future. Possible generation methods for YBR ranking are: - "trust-rank" using the link structure as can be discovered in a single crawl (idea from FSCONS) - "block-rank" calculated from the local link structure - a distributed "block-rank" using the xml API to the link structure from other peers orbiter 2010-11-29 11:07:42 +00:00
  • 3d945bb442 fix for ftp client: suppress bad directory listing time-out orbiter 2010-11-29 08:41:29 +00:00
  • d4a1a1850b removed warnings orbiter 2010-11-29 07:52:10 +00:00
  • 3b5830b7d4 *) Fixed typo. low012 2010-11-28 03:05:22 +00:00
  • 9b3fae9496 *) cleaning up the code a little bit *) program to interface, not implementation low012 2010-11-28 02:57:31 +00:00
  • 7bb4b001ed - view image files from cache - fixed generic header settings; affects CORS functionality orbiter 2010-11-27 09:16:16 +00:00
  • e7552bd719 *) cleaning up the code a little bit low012 2010-11-27 00:54:59 +00:00
  • a9741cc876 *) HTML fixes low012 2010-11-26 22:38:08 +00:00
  • 01ddb6d2ef *) HTML fixes low012 2010-11-26 22:19:31 +00:00
  • 321eb012fe removed two warnings and reverted one change orbiter 2010-11-26 11:15:42 +00:00
  • 737aaf6952 various small changes to ymarks apfelmaennchen 2010-11-25 21:16:47 +00:00
  • 8a50670546 some code clean up for the last post apfelmaennchen 2010-11-24 23:40:55 +00:00
  • 442497868d another step towards an auto tagging function for YMarks apfelmaennchen 2010-11-24 23:26:29 +00:00
  • dad5818b40 *) cleaning up the code a little bit low012 2010-11-24 01:31:41 +00:00
  • 9057e4d58c *) hopefully fixed bug described in http://www.yacy-forum.org/viewtopic.php?f=12&t=385 low012 2010-11-23 23:10:45 +00:00
  • 741a87a3e9 * make .yacy-domains crawlable (.yacy-domains are local domains, so only in custom networks/peers) f1ori 2010-11-22 19:12:51 +00:00
  • fd74bc388c * fix small bug in sessionid-removal * add testcase for seesionid-removal f1ori 2010-11-21 23:55:40 +00:00
  • dca9e16f51 * don't index pages, which redirect, twice * there fore auto-redirection of HTTPClient for crawling is disabled and the old code is reactivated f1ori 2010-11-21 22:46:12 +00:00
  • eb79b952ef *) cleaner code low012 2010-11-21 03:39:53 +00:00
  • 38fdf43587 *) renamed classes according to standard Java coding conventions *) String.isEmpty() was introduced in Java 1.6, but we still use Java 1.5 low012 2010-11-21 01:29:32 +00:00
  • 8281d12305 *) Ooops! low012 2010-11-21 00:41:45 +00:00
  • 025e3f4790 *) renamed classes according to standard Java coding conventions *) removed unsused code low012 2010-11-21 00:39:21 +00:00
  • 3b9aa0504e *) removed unsused code low012 2010-11-21 00:28:32 +00:00
  • db3db0fdb9 *) trying to make this class less confusing (probably failing) low012 2010-11-21 00:13:08 +00:00
  • 54e63b556e intermediate step for a YMark auto-tagging function based on word frequencies. apfelmaennchen 2010-11-17 15:17:29 +00:00
  • 403ee9c014 added a drill-down for metadata and word count to /api/ymarks/test_treeview.html apfelmaennchen 2010-11-16 00:48:38 +00:00
  • a025b1da89 * fix bug when browsing local filesystem (e. g. repository) with yacy f1ori 2010-11-15 14:47:16 +00:00
  • 28a290336d de.lng: Started translation of YMark feature here http://localhost:8080/Table_YMark_p.html Table_YMark_p.html: removed some labels because columns is not filled yet and added ids for other label references and added full enabled="enabled" tags because SHORTTAG is not specified in XHTML 1.0 Strict mikeworks 2010-11-15 01:48:54 +00:00
  • 25426c6548 change language immediately lotus 2010-11-13 14:27:30 +00:00
  • 11ae5b108e enabled rebuildIndex for /Table_YMark_p.html (rebuilds the tags and folders index) apfelmaennchen 2010-11-13 13:02:56 +00:00
  • f147a022f8 enabled YMark Import for /Table_YMark_p.html apfelmaennchen 2010-11-13 10:32:37 +00:00
  • 2726606fc8 slightly enhanced interface for /Table_YMark_p.html apfelmaennchen 2010-11-12 21:13:27 +00:00
  • b87bf88ac8 using less memory on merging and rewriting blobs sixcooler 2010-11-12 16:02:20 +00:00
  • 94a9be18a4 added a ymark table administration: /Table_YMark_p.html apfelmaennchen 2010-11-10 22:53:27 +00:00
  • 25339f93c7 more updates to ymarks - working xbel import/export - exported xbel includes yacy specific metadata but still validates against PUBLIC DTD apfelmaennchen 2010-11-09 17:01:31 +00:00
  • d62e449a11 * fix FilterEngine, forgot comparision-operator f1ori 2010-11-08 09:37:44 +00:00
  • cdd65aca71 update to ymarks - get_xbel.xml is almost working - startet ymark api documentation info.html apfelmaennchen 2010-11-07 20:03:01 +00:00
  • 808edffaf6 ymarks - some refactoring - working xbel and html import (/api/ymarks/test_import.html) - working treeview (/api/ymarks/test_treeview.html) apfelmaennchen 2010-11-06 20:26:13 +00:00
  • 2c539b514a * add domaincheck (local/global/domainlist) to urlcleaner f1ori 2010-11-06 16:50:33 +00:00
  • 442bebca2b * %0 does not belong to the IPv6-Address -> entry does not work on some systems f1ori 2010-11-06 15:09:28 +00:00
  • 9fc940aa35 release 0.99 0.99 orbiter 2010-11-05 13:20:51 +00:00
  • 117fc86b3d fix for http://forum.yacy-websuche.de/viewtopic.php?p=21199#p21199 orbiter 2010-11-05 13:19:37 +00:00
  • 441fbc26e2 security patch for WeakPriorityBlockingQueue (produced a deadlock) orbiter 2010-11-05 09:38:31 +00:00
  • 5dcb838293 - removed thread overhead when calling dns services - fixed localsearch (changed it by accident) orbiter 2010-11-05 00:29:32 +00:00
  • 4c50d3428e smaller file size for array stacks to support smaller deletion sizes orbiter 2010-11-04 13:29:19 +00:00
  • 09badc697b - low-memory patch for crawler orbiter 2010-11-04 13:26:27 +00:00
  • 6ac4f8142e * allow proxy requests from localhost via ipv6 (%0 does not belong to the address) f1ori 2010-11-04 10:52:54 +00:00
  • 274d5b3a87 de.lng: Added missed translation string in SVn 7301 ConfigHTCache_p.html: Added missing id for label pointing to actualCacheSize in span tag for XHTML 1.0 Strictness mikeworks 2010-11-04 07:30:00 +00:00
  • 9239ac1e56 de.lng: Added translation for new page http://localhost:8080/ConfigHTCache_p.html and old one http://localhost:8080/IndexControlRWIs_p.html ConfigHTCache_p.html: Removed additional </form> and changed title text mikeworks 2010-11-04 00:53:25 +00:00
  • becc463d8a enhanced did-you-mean orbiter 2010-11-04 00:25:19 +00:00
  • 43586a2ace a update to ymarks (please test if you wish): - import HTML (e.g. FF export) via /api/ymarks/import.html - view your import via /api/ymarks/test.html - get a xml list via /api/ymarks/get_ymark_list.xml?tags=&folders= - delete bookmark tables via standard interface /Tables_p.html it is still very experimental!! apfelmaennchen 2010-11-03 22:52:03 +00:00
  • 93c535d111 fixed http://forum.yacy-websuche.de/viewtopic.php?p=21113#p21113 fixed a concurrent modification exception during search and a time-out problem orbiter 2010-11-03 20:58:50 +00:00
  • 04932dc268 added rdf data structure for rss feeds orbiter 2010-11-03 20:06:23 +00:00
  • 84f2953cd8 fix for rss loader / rss type recognition orbiter 2010-11-03 19:58:01 +00:00
  • 4c72885cba added a sitemap entry parser and loader for sitemaps (a recursion if a sitemap refers to another sitemap) orbiter 2010-11-03 19:48:33 +00:00
  • 790e0b1894 - enhanced index deletion in IndexControlRWIs_p: delete also robots.txt database and cache if demanded - added option for details of deletion - added deletion to new ConfigHTCache_p servlet orbiter 2010-11-03 18:31:36 +00:00
  • 3e303db1fa added search suggestions in XML format. This also supports CORS orbiter 2010-11-03 09:20:36 +00:00
  • f5324b27f2 more updates to the new bookmarks (ymarks).... - split YMarkTables and YMarkIndex in two different classes - HTML import is working properly - XBEL import is still broken apfelmaennchen 2010-11-03 06:47:02 +00:00
  • 445619f3ec added a submenu ConfigHTCache_p.html to set the size of the HTCache separately from the proxy configuration. orbiter 2010-11-02 23:57:11 +00:00
  • 85c65475fa smal but important correction of last commit @ HTTPClient (if there is a response it really should be taken to its end) sixcooler 2010-11-02 21:18:44 +00:00
  • acd93b1b31 * add failsafe mechanisme to domainlist retrieval domainlist is saved locally, if none of the given urls in network.unit.domainlist could be retrieved, the file from the last boot is used instead f1ori 2010-11-02 17:57:48 +00:00
  • 70c95608d4 Added CORS Access header for yacysearch.rss output used some of the recommendations from Copro: http://forum.yacy-websuche.de/viewtopic.php?p=21015#p21015 Original Request: http://forum.yacy-websuche.de/viewtopic.php?p=20829#p20829 orbiter 2010-11-02 16:28:40 +00:00
  • 917d715374 lulabad found his signature orbiter 2010-11-01 23:32:20 +00:00
  • 18729351e7 upnp: hint for wrongly detected local ip address lotus 2010-10-31 20:05:59 +00:00
  • def4253555 * add option to network definition to provide a domainlist (syntax like in blacklists) * crawler and search allow only urls matching one in domainlist (if list is provided) * this may be useful to prevent dedicated networks from being "polluted" * FilterEngine is improved Backlist-object, Blacklist may inherit from FilterEngine in the future f1ori 2010-10-30 14:44:33 +00:00
  • 2db6ca8104 - do not allow user to disable java installation - java version update - build cleanup lotus 2010-10-30 08:49:39 +00:00
  • 482127e777 removed release key from location 2 because the signature of that source can not be verified. But the source is ok. orbiter 2010-10-29 09:22:56 +00:00
  • ac6b503adf untar files without gzip decompression even if the file has gz extension. this is done when the decompression fails. decompressed gzip files with gz extension may appear if the server sets a gzip compression header orbiter 2010-10-28 23:12:33 +00:00
  • efe0667fdd more new bookmark (ymarks) code with experimental html and xbel import apfelmaennchen 2010-10-28 15:24:15 +00:00
  • caabebf9be Fixed spelling mistake omiting -> omitting in debug messages in ConfigUpdate_p.java and Switchboard.java mikeworks 2010-10-28 04:03:11 +00:00
  • fb92f9ae8e added mime type image/jpeg (image/jpg is wrong but it is left here because it does not harm and this error also exists in configuration of web servers) see also: http://forum.yacy-websuche.de/viewtopic.php?p=21129#p21129 orbiter 2010-10-27 21:53:11 +00:00
  • 155d556568 - better memory protection - more logging - little bit of refactoring orbiter 2010-10-27 13:21:18 +00:00
  • a16e5e7a63 ConfigPortal.html: Added safe space character because empty <dt> produces a warning in HTML validator de.lng: Added translation for added ConfigPortal selection for target selection and WikiHelp changes mikeworks 2010-10-26 21:14:37 +00:00
  • 7d8de34778 * add a bit documentation to DigestURI, use DigestURI(string) instead of DigestURI(string, null) f1ori 2010-10-26 16:10:20 +00:00
  • 25a8e55bc9 more logging about bad seeds orbiter 2010-10-26 15:00:22 +00:00
  • 959b8c6fa0 - allow greater seed size - more logging for bad seeds orbiter 2010-10-26 14:40:19 +00:00
  • e103419a56 - removed <3 peers barrier for peer ping feedback - more logging orbiter 2010-10-26 13:08:09 +00:00
  • d0e6c03b51 some updates to the new bookmark code... apfelmaennchen 2010-10-25 22:44:05 +00:00
  • facfd204e9 added a parent configuration option. see /ConfigPortal.html requested here: http://forum.yacy-websuche.de/viewtopic.php?p=21099#p21099 orbiter 2010-10-25 22:16:07 +00:00
  • e3964f2c31 better catch of network definition load error; continue with secondary network load definition location orbiter 2010-10-25 09:20:45 +00:00
  • 7adfe4a1c1 fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=35#p21092 apfelmaennchen 2010-10-25 08:12:04 +00:00
  • b8d6784410 *) WikiCode help page update low012 2010-10-25 00:30:10 +00:00
  • 65a0381f76 *) cleaning up code (still not done) low012 2010-10-24 23:57:07 +00:00
  • e3e3b49d52 - enhanced main release recognition - yacybot user agent now includes the yacy network name (not the peer name!) - refactoring and clean-up (mostly turned tab into spaces) orbiter 2010-10-24 21:43:01 +00:00
  • 9c94ebdee4 small changes to new bookmark code... apfelmaennchen 2010-10-22 13:14:09 +00:00
  • 244b56e9d3 an update to the new bookmark code... apfelmaennchen 2010-10-21 19:18:17 +00:00
  • 96e625c541 AccessTracker: Fixed table output while no results are displayed --> all of AccessTracker should now validate XHTML 1.0 Strict, also tfoot must be defined before tbody Connections_p: Added static protocol "http" for incoming connections because only http pool is queried and -UNRESOLVED PATTERN- is not really useful de.lng: Minor changes and removed one old line mikeworks 2010-10-19 03:26:18 +00:00
  • dc40f51b8d *) added headlines as proposed by Vega *) <pre> will be displayed monospaced in wiki and blog again *) bugfix for <pre> spanning multiple lines *) replaced deprecated <s> tag with <span> equivalent low012 2010-10-18 23:00:43 +00:00
  • f035f257da added some more bookmark code... apfelmaennchen 2010-10-18 21:09:41 +00:00