Commit Graph

  • 97615af406 *) Restructuring of YaCy SOAP services - general functions moved to abstract service class - service class splitted into SearchService, CrawlService, StatusService *) Bugfix for SOAP search services - Attention: some xml tages where renamed See: http://www.yacy-forum.de/viewtopic.php?p=25877 *) New SOAP service function urlInfo to view the parsed content of an URL See: http://www.yacy-forum.de/viewtopic.php?p=25869 theli 2006-09-26 14:47:44 +00:00
  • 241b881560 *) Redesign of YaCy SOAP handler - should be more fail-safe now - better handling of compressed request bodies - better handling of persistent connections - better handling of AxisFaults theli 2006-09-26 12:24:40 +00:00
  • 009a33170b *) Content-Location header added theli 2006-09-26 04:32:01 +00:00
  • b0cab1e731 *)Adapted surftipps to use common 0/1 parameters *)Added translation of WatchCrawler.html *)Changed format of German translation. Formal description will probably follow. rramthun 2006-09-24 20:15:28 +00:00
  • f17a91313e *)Updated phosphor.css for XHTML *)New grey skin Thanks to Philipp Redeker! rramthun 2006-09-24 17:55:21 +00:00
  • 1aa07a52cd *) Bugfix for UnsupportedEncodingException if the media type contains multiple parameters See: http://www.yacy-forum.de/viewtopic.php?p=25832#25826 theli 2006-09-24 15:50:51 +00:00
  • 4922ab8920 try to fix a nullpointer on snippet generation allo 2006-09-22 22:51:44 +00:00
  • d8fde14c3a Make maximum number of words in DHT-In cache configurable at runtime hermens 2006-09-22 12:44:58 +00:00
  • 625c2ce6b1 *) bugfix for snippet fetching problem if content but not http header is available in cache See: http://www.yacy-forum.de/viewtopic.php?p=25748 theli 2006-09-22 11:55:28 +00:00
  • 813a8a8179 *) migration of mimeTypeParser to jmimemagic 0.1 - better mimetype detection for rss feeds - better mimetype detection for odt documents (less memory consuming) - two new detector classes implementing MagicDetector interface of jmimemagic theli 2006-09-22 11:40:46 +00:00
  • 3f5a4153a0 Make Peers more receptible to transferred indexes - Set MaxWordCount for dhtInCache to indexDistribution.dhtReceiptLimit so that the inCache gets flushed when the limit is passed - Modify flushCacheSome to flush enough words to get below MaxWordCount immediately hermens 2006-09-22 10:58:58 +00:00
  • 740696f6c3 *) few fixes XHTML-validation ( there is still much to do) hydrox 2006-09-22 08:04:24 +00:00
  • 57415b6889 *) Bugfix for surftipps UTF-8 problem See: http://www.yacy-forum.de/viewtopic.php?t=2864 theli 2006-09-22 05:40:29 +00:00
  • 706572f18d *) Bugfix for ArithmeticException caused by setting max crawling thread count was to 0 See: http://www.yacy-forum.de/viewtopic.php?t=2862 theli 2006-09-22 04:29:31 +00:00
  • 2d3b96eeba bugfixes for surftipps - added missing authorization check for votes - second vote on same entry was possible after complete publishing of current vote orbiter 2006-09-21 21:37:40 +00:00
  • 85f3617835 *) moved HTML from class-file to template-file (please check if it is valid HTML) hydrox 2006-09-21 20:36:46 +00:00
  • 9434dba8f2 *) corrected title of IndexCleaner_p.html hydrox 2006-09-21 07:26:41 +00:00
  • b0a4fcce8c fix from theli allo 2006-09-20 18:03:24 +00:00
  • b6c7b91582 *) Parser now throws an ParserException instead of returning null on parsing errors (e.g. needed by snippet fetcher) *) better logging of parser failures *) simplified usage of plasmaparser through switchboard *) restructuring of crawler - crawler now returns an error message if it is used in sync mode (e.g. by snippet fetcher) *) snippet-fetcher: more verbose error messages *) serverByteBuffer.java: adding new function append(String,encoding) *) serverFileUtils.java: adding functions to copy only a given number of bytes between streams theli 2006-09-20 12:25:07 +00:00
  • aa38721cf6 new features for surftipps - new generation with less memory - removal of doubles - positive votes can generate entries without original news (so they can live on) - link deletion on search results are now also negative votes for surftipps (but they may rarely hit any news) orbiter 2006-09-20 12:01:51 +00:00
  • 64b2ef5aae *) Trying to bugfix shutdown problem theli 2006-09-20 10:13:23 +00:00
  • e03427871e enhanced surftipps: - added switchh to show or hide surftipps - more news contribute to surftipps - added voting system for surftipps orbiter 2006-09-20 07:17:41 +00:00
  • e745b63c77 *) Bugfix for different behavior of indexDistributeWhileCrawling to other checkboxes on IndexControl_p.html See: http://www.yacy-forum.de/viewtopic.php?t=2849 theli 2006-09-20 04:44:56 +00:00
  • 1dc12d6659 *) Bugfix for shutdown problem caused by cacheScan thread See: http://www.yacy-forum.de/viewtopic.php?p=25729 theli 2006-09-20 04:36:25 +00:00
  • 42173462f5 rename cutUrlText to shortenURLString; other little things; borg-0300 2006-09-19 20:47:45 +00:00
  • af1d89e381 check url == null added; borg-0300 2006-09-19 20:12:26 +00:00
  • cc667b0aa5 *) htmlFilterContentScraper.java: adding support for link tag theli 2006-09-19 16:13:13 +00:00
  • 16ba5d1b46 topwords: only [a-z] words, quality is better; blank removes; properties added; borg-0300 2006-09-19 10:44:45 +00:00
  • 66a58502df *) configure logging filehandler to use UTF-8 for logging messages theli 2006-09-19 05:39:50 +00:00
  • 26dfbb7499 *) Bugfix for UTF-8: url names are now stored properly in stackcrawl, crawler, indexing queue and should be displayed correct on the gui theli 2006-09-19 05:19:41 +00:00
  • cf6acff2c2 *) Bugfix. htmlFilterInputStream document analysis did not work properly for documents smaller than the default InputStream Buffer size. theli 2006-09-19 04:58:34 +00:00
  • f18304ddd3 unused/not needed imports removes; properties added; borg-0300 2006-09-18 22:21:18 +00:00
  • ec031eb993 first version of surftipps see http://localhost:8080/index.html orbiter 2006-09-18 20:14:21 +00:00
  • b174fbd0ca "import ...*" removed; properties added; borg-0300 2006-09-18 18:31:27 +00:00
  • 807756150e patch for strange bug reported by email orbiter 2006-09-18 16:50:31 +00:00
  • 5c6251bced *) some improvements for extended html document charset support - new class htmlFilterInputStream.java which allows to pre-analyze the html header to extract the charset meta data. This is only enabled for the crawler at the moment. Integration into proxy needs more testing. - adding eventlisterner interfaces to the htmlscraper to allow other classes to get informed about detected tags (used by the htmlFilterInputStream.java) theli 2006-09-18 15:36:04 +00:00
  • 33f0f703c0 *) reinserting type cast again theli 2006-09-18 13:21:12 +00:00
  • 8c11a543dc fixed line ending coding orbiter 2006-09-18 13:17:31 +00:00
  • b690597275 *) adding casts to avoid compatibility problems between java 1.4 and java 1.5 writer class usage theli 2006-09-18 12:17:51 +00:00
  • 5afb0cbce8 *) setting default charset (for unkown documents) to iso-8859-1 *) theli 2006-09-18 11:39:06 +00:00
  • f453c14b5d removed unreacheable catch blocks and unused imports orbiter 2006-09-18 11:23:58 +00:00
  • ad7f600f25 *) Bugfix. re-enabling inheritance of serverCharBuffer from writer class theli 2006-09-18 11:04:16 +00:00
  • 97d2a08ef1 *) restructuring needed to support parsing of documents using various charsets - serverFileUtils.java: -- adding methods to copy from stream to writer and readers to writers -- moving httpc writeX methods into serverFileUtils class - serverCharBuffer.java: removing inheritance from Writer class - replacing htmlFilterOutputStream by htmlFilterWriter class which handles content as char stream - htmlFilterContentTransformer.java: deactivating getText mode (still needs to be migrated to use char streams instead of byte streams) - changes in several classes to use htmlFilterWriter instead of htmlFilterOutputStream - changes in Scraper and Transformer classes to operate on chars instead of bytes - httpdProxyHandler.java: bugfix. clientTimeout setting was missing in config file theli 2006-09-18 10:12:11 +00:00
  • fc594e8eda *) adding httpContentLengthInputStream.java class to allow reading of http response bodies until EOF even if a persistent connection is used *) httpdByteCountInputStream.java: adding skip method *) httpHeader.java: adding getCharacterEncoding function theli 2006-09-18 10:00:28 +00:00
  • cd636eb00e *) Fix for the fix... low012 2006-09-18 01:24:26 +00:00
  • f9a5b55a9e *) Fixed bug described in http://www.yacy-forum.de/viewtopic.php?p=25448#25448 low012 2006-09-18 01:19:54 +00:00
  • 3aac5b26da - added automatic tag generation when a web page from the search results is added - added new image 'B' in front of search results for bookmark generation - added news generation when a public bookmark is added - the '+' in front of search results has new meaning: positive rating for that result - added news generation when a '+' is hit orbiter 2006-09-18 00:37:02 +00:00
  • 8a30c5343d *) Fixed bug where exclamation marks could get lost between [=...=] and <pre>...</pre> low012 2006-09-17 23:42:36 +00:00
  • d8f4b17e31 *) Hopefully fixed bug described in http://www.yacy-forum.de/viewtopic.php?t=2825. low012 2006-09-17 22:57:10 +00:00
  • 2d9496577f Removed double labels for forms in Blacklist_p.html michitux 2006-09-17 08:07:21 +00:00
  • aa46269eff Less margin/padding for dls (e.g. in Messages) michitux 2006-09-17 07:42:07 +00:00
  • 567c40f5f0 Bookmark/delete-links now visible when mouse is over the searchresult, in standard-compliant browsers with css, in Microsoft Internet Explorer via JavaScript michitux 2006-09-16 16:56:22 +00:00
  • 0e84a969d6 *) Bugfix for serverCharBuffer read from file operation theli 2006-09-16 13:11:32 +00:00
  • 90ef19d778 *) first version of a serverCharBuffer theli 2006-09-16 12:56:03 +00:00
  • d374ef2bbe bugfix for tryRemoveURLs orbiter 2006-09-16 00:34:34 +00:00
  • f644a1c3a7 better evaluation of index abstracts orbiter 2006-09-16 00:07:09 +00:00
  • 1b48473bc5 bugfix to utf8 recognition orbiter 2006-09-15 23:55:06 +00:00
  • 90f7241b59 serverByteBuffer.trim() can now recognize utf-8 characters orbiter 2006-09-15 23:52:26 +00:00
  • 2fd610b556 http://www.yacy-forum.de/viewtopic.php?p=25611#25611 allo 2006-09-15 17:48:41 +00:00
  • 20e1754379 Various fixes for the languages rramthun 2006-09-15 16:02:37 +00:00
  • e34d9b3fec *) charset aware headlines (after the serverByteBuffer.trim problem is solved) theli 2006-09-15 15:07:35 +00:00
  • 8115ac47b5 *) charset aware metadata parsing theli 2006-09-15 15:01:25 +00:00
  • 3ac30bdf22 *) some todo markers added for additional charset support theli 2006-09-15 14:49:43 +00:00
  • d54144a4e3 fixed bad snippet behavior (hopefully) orbiter 2006-09-15 14:17:18 +00:00
  • 06fa891152 *) htmlFilterContentScraper.java: using proper charset for document title theli 2006-09-15 14:05:28 +00:00
  • 5015e780c2 - simplified watchCrawler code - changed display of watchCrawler slightly orbiter 2006-09-15 13:54:10 +00:00
  • 74c3e7cf29 *) storing document charset into plasmaParserDocument object (is needed later by the condenser) *) htmlFilterContentScraper.java: using proper charset for document title *) serverByteBuffer.java: adding new toString which allows to specify the charset for byte encoding theli 2006-09-15 13:18:12 +00:00
  • c5d3020941 *) better errorhandling for last commit theli 2006-09-15 12:56:01 +00:00
  • d0a5a53789 *) changes needed for multi-language support - parsers may need to know the charset of the byte stream theli 2006-09-15 12:52:46 +00:00
  • 31d6cdea53 WatchCrawler.html now valid xhtml, added the class TableCellActive to default skin, please update your skins (sorry, I removed it before because I hadn't seen it in any html-file) michitux 2006-09-15 11:50:25 +00:00
  • d82875c72b removed removal of 'funny symbols' that may have caused utf-8 problems orbiter 2006-09-15 09:08:15 +00:00
  • 26ab1fa885 fixed null pointer exception See http://www.yacy-forum.de/viewtopic.php?p=25598#25598 orbiter 2006-09-15 08:50:16 +00:00
  • 9bed90f8dc bugfix in js allo 2006-09-15 06:33:22 +00:00
  • b0e8ff6eda *) some TODO makers for UTF-8 problem theli 2006-09-15 05:31:30 +00:00
  • b5904705ab *) Bugfix for "determineRevisionNr: build.xml:98: SVN entries file does not exist" bug See: http://www.yacy-forum.de/viewtopic.php?t=2824 theli 2006-09-15 04:38:36 +00:00
  • c42b011648 added watch crawler to menu orbiter 2006-09-15 01:09:34 +00:00
  • 41e27b85b7 fix for crawler condition orbiter 2006-09-15 00:38:45 +00:00
  • 92157febcd Bugfix for Blacklist_p.html: Adding of new patterns possible again michitux 2006-09-14 15:20:32 +00:00
  • 0ee7e45413 bugfix for merge method (caused by bad refactoring) see http://www.yacy-forum.de/viewtopic.php?p=25529#25529 orbiter 2006-09-14 10:30:25 +00:00
  • 40965e183e bugfix for minimizeurldb and urldbcleanup see http://www.yacy-forum.de/viewtopic.php?p=25539#25539 orbiter 2006-09-14 10:12:41 +00:00
  • 5c2f30eaca adjustments to dhtInCache write orbiter 2006-09-14 09:28:17 +00:00
  • 9ecf7f0da2 *) some TODO makers for UTF-8 problem theli 2006-09-14 05:37:46 +00:00
  • e2f8339827 *) some bugfixes for UTF-8 related problems theli 2006-09-14 05:16:36 +00:00
  • f4af607b79 *) just some typos low012 2006-09-14 01:11:49 +00:00
  • e03740c306 small fix for last commit orbiter 2006-09-14 00:57:41 +00:00
  • c89d8142bb replaced old 'kCache' by a full-controlled cache there are now two full-controlled caches for incoming indexes: - dhtIn - dhtOut during indexing, all indexes that shall not be transported to remote peers because they belong to the own peer are stored to dhtIn. It is furthermore ensured that received indexes are not again transmitted to other peers directly. They may, however be transmitted later if the network grows. orbiter 2006-09-14 00:51:02 +00:00
  • 6e2907135a bugfixes for remote search server part orbiter 2006-09-13 22:19:34 +00:00
  • 2c6f2a1f74 First language fixes for new XHTML-layout rramthun 2006-09-13 20:03:15 +00:00
  • cf9884e22b first attempt to implement a secondary search this is a set of search processes that shall enrich search results with specialized requests to realize a combination of search results from different peers. orbiter 2006-09-13 17:13:28 +00:00
  • 2a06ce5538 *) next bugfix for UTF-8 - Sending UFT-8 messages to other peers did not work - httpd.java: minor corrections for UTF-8 theli 2006-09-13 15:47:56 +00:00
  • bdc51591ae *) UTF-8 Bug solved (hopefully) See: http://www.yacy-forum.de/viewtopic.php?p=25522 theli 2006-09-13 14:48:58 +00:00
  • 13d0cff257 right dhtml. allo 2006-09-13 14:02:34 +00:00
  • ef751b9d33 *) removing all string operations from the template engine - engine should fully operate on bytes now theli 2006-09-13 13:56:10 +00:00
  • 7ef80c1026 more debugging orbiter 2006-09-13 13:52:46 +00:00
  • dfc0ac1958 syntax error fixes orbiter 2006-09-13 12:02:02 +00:00
  • 6e03f61daa fix for highlighting searched words in snippets allo 2006-09-13 11:26:26 +00:00
  • b251076e64 avoid ConcurrentModificationException orbiter 2006-09-13 10:36:18 +00:00
  • 99699a4d70 *) bugfix for new svn revision number ant task theli 2006-09-13 10:19:22 +00:00
  • da9f67a56d *) bugfix needed because of new svn version 1.4 (new .svn/entries file format) - lib/svnRevNr.jar: customized ant task to read the revision number out of the .svn/entries file - build.xml: calling new ant task theli 2006-09-13 09:59:32 +00:00
  • 3bbe6a77da Smaller font-size for tables in PerformanceQueues_p.html and PerformanceMemory_p.html michitux 2006-09-12 21:41:16 +00:00