Commit Graph

  • 3f234a483d Remove samedom/samesite/samehost Ai Lin Chia 2016-06-07 12:06:18 +02:00
  • 32e5c5e2f4 (minor optimization) use suitable default hash table size of html entities Ivan Skytte Jørgensen 2016-06-07 13:28:33 +02:00
  • e0f49d9a78 Use ttps://www.w3.org/TR/html5-author/entities.json and html entity source Ivan Skytte Jørgensen 2016-06-07 12:47:08 +02:00
  • f00fd4ba8b Support more html entities (from https://www.w3.org/TR/2014/REC-html5-20141028/entities.json) Ivan Skytte Jørgensen 2016-06-07 12:25:24 +02:00
  • 19e71f6467 Handle unicode codepoints >= U+10000 correctly Ivan Skytte Jørgensen 2016-06-07 12:19:17 +02:00
  • 470807ca48 Make fwd and def of functions consistent Ivan Skytte Jørgensen 2016-06-06 18:04:51 +02:00
  • 1edcc81d39 More constness in Inlink class Ivan Skytte Jørgensen 2016-06-06 13:07:15 +02:00
  • a78c0a69a0 bit more constness Ivan Skytte Jørgensen 2016-06-03 17:18:49 +02:00
  • 785b23f239 Make makeLinkInfo() static Ivan Skytte Jørgensen 2016-06-03 17:06:00 +02:00
  • dbfd6c2e68 Remove support for isparentpingserver from url filters. Ai Lin Chia 2016-06-03 16:25:35 +02:00
  • e10bd93482 Remove support for isparentsitemap from url filters. Ai Lin Chia 2016-06-03 16:01:47 +02:00
  • 9979f9db82 Remove support for isparentrss from url filters. (reduce number of spiderdb duplicates) Remove isparentpermalink from spiderdb Ai Lin Chia 2016-06-03 15:38:36 +02:00
  • 3bfffa103b Removed unused qbuf parameter from getLinkInfo() Ivan Skytte Jørgensen 2016-06-03 15:36:04 +02:00
  • ba2f1689fc Change info log back to debug Ai Lin Chia 2016-06-03 14:12:59 +02:00
  • 141f1030b8 Remove always true variable usepartfile Ai Lin Chia 2016-06-03 14:12:39 +02:00
  • ad37b54540 Fix Parms.cpp so that we can set some rdb controls for test setup Ai Lin Chia 2016-06-03 12:11:04 +02:00
  • 16ed5f9d63 Fix use of uninitialized data in Collectionrec (filters) Ivan Skytte Jørgensen 2016-06-03 12:06:28 +02:00
  • 53a7a22b0f Code style changes Ai Lin Chia 2016-06-03 10:39:20 +02:00
  • a9237d04fe Disable part2 of adult words in a better way Ivan Skytte Jørgensen 2016-06-02 14:39:40 +02:00
  • 53a664f2eb const char* in RdbBuckets Ivan Skytte Jørgensen 2016-06-02 14:25:16 +02:00
  • 46ca63221b Fix <= test on boolean Ivan Skytte Jørgensen 2016-06-02 14:20:50 +02:00
  • df87e6eedd bugfix RdbBuckets::fastLoad() Ivan Skytte Jørgensen 2016-06-02 13:29:05 +02:00
  • d4ceb566c5 Add some test for Url::GetSubPathLen Ai Lin Chia 2016-06-02 13:07:03 +02:00
  • fd7359f77d Removred unused HttpRequest::isMSIE() Ivan Skytte Jørgensen 2016-06-02 12:49:28 +02:00
  • 71dca80eeb more constness in HttpRequest Ivan Skytte Jørgensen 2016-06-02 12:47:23 +02:00
  • 1572775264 Check correctly for number parsing errors Ivan Skytte Jørgensen 2016-06-02 12:35:03 +02:00
  • 4059eb7ebd Fix conversion from string literal to 'char *' for PingServer Ai Lin Chia 2016-06-01 15:57:35 +02:00
  • 601634b74f Fix warning no previous extern declaration for non-static variable Ai Lin Chia 2016-06-01 15:52:36 +02:00
  • 117f571393 Fix conversion from string literal to 'char * for Parms Ai Lin Chia 2016-06-01 15:36:31 +02:00
  • d2e5db7f1c Fix warning class 'StateControl' was previously declared as a struct Ai Lin Chia 2016-06-01 15:36:05 +02:00
  • a908e4a6e9 Add constness to getHostFast Ai Lin Chia 2016-06-01 15:35:25 +02:00
  • ce7299aa2b Remove always false parameter Ai Lin Chia 2016-06-01 15:00:43 +02:00
  • 59b5146607 Remove commented out code Ai Lin Chia 2016-06-01 11:10:34 +02:00
  • d8ece42394 Add/change todo comments Ai Lin Chia 2016-06-01 11:10:06 +02:00
  • e369c8aaaa Changed UDP peek+read to just read Ivan Skytte Jørgensen 2016-05-31 14:54:23 +02:00
  • e4d192eab6 Don't pass NULL into strncmp Ai Lin Chia 2016-05-31 12:42:25 +02:00
  • 5d24124a78 Fix UrlParser logic to be more similar to Url Ai Lin Chia 2016-05-31 12:04:33 +02:00
  • aba543a168 Fix conversion from string literal to 'char *' for XmlDoc Ai Lin Chia 2016-05-31 11:38:52 +02:00
  • 023673fab0 Fix conversion from string literal to 'char *' for Conf, Msg12, Msg3, MsgC, Parms, PingServer, Rdb, RdbBase, RdbBuckets, TcpServer, XmlDoc Ai Lin Chia 2016-05-31 10:46:35 +02:00
  • 70ef08c60b Fix conversion from string literal to 'char *' for Unicode Ai Lin Chia 2016-05-30 17:14:01 +02:00
  • cfb695ea7e Fix conversion from string literal to 'char *' for SearchInput Ai Lin Chia 2016-05-30 17:13:40 +02:00
  • 8c04cf0e56 Fix conversion from string literal to 'char *' for RdbDump Ai Lin Chia 2016-05-30 17:13:26 +02:00
  • c5d623d608 Fix conversion from string literal to 'char *' for HttpServer Ai Lin Chia 2016-05-30 17:12:48 +02:00
  • 3c7440f769 Fix conversion from string literal to 'char *' for Parms Ai Lin Chia 2016-05-30 17:12:00 +02:00
  • 9fa82f9197 Fix conversion from string literal to 'char *' for Msg3 Ai Lin Chia 2016-05-30 17:11:40 +02:00
  • 9f2c47503a Fix conversion from string literal to 'char *' for RdbBase Ai Lin Chia 2016-05-30 17:11:26 +02:00
  • 489f09f602 Fix conversion from string literal to 'char *' for Posdb Ai Lin Chia 2016-05-30 17:07:45 +02:00
  • 3b08a29043 Fix conversion from string literal to 'char *' for PageLogView Ai Lin Chia 2016-05-30 17:07:29 +02:00
  • 9682469635 Fix conversion from string literal to 'char *' for MsgC Ai Lin Chia 2016-05-30 17:07:19 +02:00
  • acd0fe5ee2 Fix conversion from string literal to 'char *' for PageStatsdb Ai Lin Chia 2016-05-30 17:07:11 +02:00
  • 8dba617b75 Partial fix conversion from string literal to 'char *' for Tagdb Ai Lin Chia 2016-05-30 17:06:55 +02:00
  • c83be13968 Fix conversion from string literal to 'char *' for RdbList Ai Lin Chia 2016-05-30 17:06:43 +02:00
  • 6b2a1231e2 Fix conversion from string literal to 'char *' for Mem Ai Lin Chia 2016-05-30 17:06:31 +02:00
  • 90c21f8120 Fix conversion from string literal to 'char *' for Dns Ai Lin Chia 2016-05-30 17:06:11 +02:00
  • 89910ffdca Fix conversion from string literal to 'char *' for BigFile Ai Lin Chia 2016-05-30 17:05:52 +02:00
  • 6e2abdd9da Fix conversion from string literal to 'char *' for Linkdb Ai Lin Chia 2016-05-30 16:47:30 +02:00
  • 7aff240f44 Fix conversion from string literal to 'char *' for main Ai Lin Chia 2016-05-30 16:34:34 +02:00
  • d1ca951e57 Fix conversion from string literal to 'char *' for Statsdb Ai Lin Chia 2016-05-30 16:23:18 +02:00
  • 190d68512d Fix conversion from string literal to 'char *' for SiteGetter Ai Lin Chia 2016-05-30 16:23:03 +02:00
  • dd51a0d8bd Fix conversion from string literal to 'char *' for Repair Ai Lin Chia 2016-05-30 16:22:28 +02:00
  • e6288534d8 Fix conversion from string literal to 'char *' for Query Ai Lin Chia 2016-05-30 16:22:13 +02:00
  • a64abad471 Fix conversion from string literal to 'char *' for Wiktionary Ai Lin Chia 2016-05-30 16:13:15 +02:00
  • 1c87cf1b82 Fix conversion from string literal to 'char *' for SpiderProxy Ai Lin Chia 2016-05-30 16:12:50 +02:00
  • 58ca00ecd2 Fix conversion from string literal to 'char *' for Sections Ai Lin Chia 2016-05-30 16:12:42 +02:00
  • ceae302fec Fix conversion from string literal to 'char *' for Msg40 & Msg51 Ai Lin Chia 2016-05-30 16:12:21 +02:00
  • 0499e9d53b Fix conversion from string literal to 'char *' for PageCrawlBot Ai Lin Chia 2016-05-30 16:01:10 +02:00
  • cd1fb98555 Fix conversion from string literal to 'char *' for qa Ai Lin Chia 2016-05-30 15:53:55 +02:00
  • e4a5cba1ca Fix conversion from string literal to 'char *' for Spider Ai Lin Chia 2016-05-30 15:45:25 +02:00
  • b3248b7556 Fix conversion from string literal to 'char *' for TcpServer Ai Lin Chia 2016-05-30 15:31:03 +02:00
  • feafdf1dca Partial fix conversion from string literal to 'char *' for XmlDoc Ai Lin Chia 2016-05-30 15:19:21 +02:00
  • c576088427 Fix conversion from string literal to 'char *' for PageAddColl Ai Lin Chia 2016-05-30 15:13:58 +02:00
  • c89c23e645 Fix conversion from string literal to 'char *' for PageResults Ai Lin Chia 2016-05-30 15:12:44 +02:00
  • 9c92e7d058 Fix conversion from string literal to 'char *' for Unicode Ai Lin Chia 2016-05-30 15:03:59 +02:00
  • 4f0d77b96c Fix conversion from string literal to 'char *' for StopWords Ai Lin Chia 2016-05-30 14:38:32 +02:00
  • 7093ea212f Fix conversion from string literal to 'char *' for PageBasic Ai Lin Chia 2016-05-30 14:38:22 +02:00
  • 04f18e89f3 Fix conversion from string literal to 'char *' for Entities Ai Lin Chia 2016-05-30 14:38:04 +02:00
  • b8dcb7f994 Fix conversion from string literal to 'char *' for CountryCode Ai Lin Chia 2016-05-30 14:37:51 +02:00
  • 6a763c49d3 Fix conversion from string literal to 'char *' for Abbreviations Ai Lin Chia 2016-05-30 14:37:34 +02:00
  • 4aff4e01ac Fix compilation warning Ai Lin Chia 2016-05-30 13:49:33 +02:00
  • b0bdb7ec1d fix static/inline forward declarations Ivan Skytte Jørgensen 2016-05-30 23:39:13 +02:00
  • b2e8d6644a Removed unused parameter sock from UdpSlot::readAck() Ivan Skytte Jørgensen 2016-05-30 17:51:09 +02:00
  • d2b63bd629 constness in Dns Ivan Skytte Jørgensen 2016-05-30 17:04:21 +02:00
  • 36ea23a68e constness in UCPropTable Ivan Skytte Jørgensen 2016-05-30 16:47:04 +02:00
  • be23f6c4ca fix spider detection of corrupted requests. fix deduping so it doesn't core on docid based spider requests. Matt Wells 2016-05-28 08:40:59 -07:00
  • 5686d6dbec fix dns lookup bug that was causing us to get incorrect ips sometimes. Matt Wells 2016-05-17 11:57:21 -07:00
  • c653c27f92 fix empty file bug again Matt Wells 2016-05-13 14:49:54 -07:00
  • fe94126ae3 trash files of length 0 that are holding up a merge. if we can't merge files we end up stockpiling them and things get slow fast. Enhance dedup in spider Matt Wells 2016-05-13 13:21:43 -07:00
  • 228fd7bd53 Fixes for new tlds. Zak Betz 2016-05-12 16:04:21 -06:00
  • 3fee63a2d7 Updated tld list with the most current list. Zak Betz 2016-05-04 11:17:51 -06:00
  • d50248d3e8 fix cores from XmlDoc::getLinkInfo1() returning -1 because of its call to getFirstIp() presumably. Matt Wells 2016-04-18 09:55:00 -07:00
  • 6f06f15a27 Merge c5de65a78a8795a36480d489adb4dc0eb3364675: getLinkInfo1() returning -1 Matt Wells 2016-04-17 18:50:23 -07:00
  • 5837b075a6 Merge 95a3a261db8241f1cc671680b5318b5e3bd986e9 Matt 2016-04-15 13:34:41 -06:00
  • 1959f796f3 Merge of cf7ec13de61aa7c7b80f6ea2f69a267ab7f908e4 (already done) Zak Betz 2016-03-29 12:41:34 -06:00
  • 0eef3939d9 forgotten commit Ivan Skytte Jørgensen 2016-05-30 13:37:55 +02:00
  • 6a63a2ce89 Add criteria for stripping session parameter. Add missing tests. Ai Lin Chia 2016-05-30 12:49:23 +02:00
  • e8fb2d0987 Move around UrlTest so that it's similar to the structure in Url.cpp Add UrlComponent::Validator test Ai Lin Chia 2016-05-27 18:19:35 +02:00
  • 2adc21655e Fix bug in UrlComponent where allow criteria is wrong. Fix code style to make it clearer. Ai Lin Chia 2016-05-27 18:13:45 +02:00
  • 4472cdcc3f Misc. cleanup in UnicodeProperties.h Ivan Skytte Jørgensen 2016-05-30 12:42:20 +02:00
  • 37c9db1233 constness in linkspam Ivan Skytte Jørgensen 2016-05-30 12:42:01 +02:00
  • 888947f006 Merge branch 'master' of github.com:privacore/open-source-search-engine Ivan Skytte Jørgensen 2016-05-30 11:06:38 +02:00