Commit Graph

  • 50c64f9369 fix printing of getInlineSectionVotingBuf() to be more accurate mwells 2014-07-09 15:44:41 -07:00
  • 05fcef9651 more vote infusion and squid proxy fixes. mwells 2014-07-09 14:57:58 -07:00
  • d4218e01d7 inject docs that come through our squid proxy mwells 2014-07-09 12:25:23 -07:00
  • d7b67f21e7 return error if we get CONNECT requests. we don't handle those because we can't cache them or inject the sectiondb voting info into their tags because they are encrypted from us. mwells 2014-07-09 11:06:46 -07:00
  • 0f9409235e some cleanups mwells 2014-07-09 10:41:38 -07:00
  • 62c920efd0 Merge branch 'testing' into diffbot-matt mwells 2014-07-09 10:23:27 -07:00
  • 205349cfbb added a clone coll page. for after creation cloning. mwells 2014-07-09 07:54:29 -07:00
  • 192b2ee393 added sendPageClone() entire page for cloning as well mwells 2014-07-09 07:33:14 -07:00
  • f7e7468e74 get clone working when adding new coll mwells 2014-07-09 07:13:24 -07:00
  • 0b64d7f0af show api/xml/json in serps mwells 2014-07-09 06:36:36 -07:00
  • a154f679d1 some setup for qaspider() mwells 2014-07-08 20:33:13 -07:00
  • 5ae476f34e print facets for each search result mwells 2014-07-08 19:38:54 -07:00
  • 1af75c5d88 send back facet field/value pairs in msg20reply mwells 2014-07-08 14:22:55 -07:00
  • 99872e9f72 Merge branch 'diffbot-testing' into testing mwells 2014-07-08 13:38:10 -07:00
  • 48d12eb147 application/xhtml+xml should be type html not xml otherwise we don't end up spidering the links Matt Wells 2014-07-08 13:34:53 -07:00
  • a09ba6261f fix ./gb installgb Matt Wells 2014-07-08 13:23:22 -07:00
  • a4273a1269 section voting markup updates mwells 2014-07-08 11:14:45 -07:00
  • 1e8f6ce474 forgot to specify coll for facet link mwells 2014-07-08 10:40:34 -07:00
  • e658ebc8f6 fix up sections page some more. useful for debugging sections stuff. mwells 2014-07-08 10:31:42 -07:00
  • 842d72b5db Merge branch 'testing' into diffbot-matt mwells 2014-07-08 09:58:54 -07:00
  • 5a557e765a added copyCollRec() function to clone setting of one coll to another. mwells 2014-07-08 07:57:49 -07:00
  • d7cc290a1f added a few new search parms that can be used to override collection defaults. hide all clustered results. max title len. max summary excerpt/line width. mwells 2014-07-08 07:01:51 -07:00
  • eb7d83cbad added &showimages=0 parm. also print image url. mwells 2014-07-08 06:26:42 -07:00
  • a7bddbcc0b return up to the first 3 h1 tags when &geth1tag=1 is specified for an xml or json feed. mwells 2014-07-07 21:01:07 -07:00
  • 445896e04c fix query reindex core Matt Wells 2014-07-07 19:11:01 -07:00
  • 67ba89dd11 added gbequalint: query operator for showing docs with a specific facet VALUE. mwells 2014-07-07 17:40:49 -07:00
  • c8567f8a24 sectioning stuff working halfway decent. still need to do docid-based stats perhaps. need to scroll to section hash when clicking the 'sections' link. mwells 2014-07-07 16:46:38 -07:00
  • a3ef40ccf5 pass qa test mwells 2014-07-07 12:42:30 -07:00
  • d9ae010371 shard gbfacetstr:gbxpathsitehash123456 terms by termid for speed. got them working again multicasting a msg 0x39 to the appropriate shard. set special msg39request flag for better performance for those guys. mwells 2014-07-07 12:32:27 -07:00
  • 6434e5cc04 Merge branch 'testing' into diffbot-matt mwells 2014-07-07 09:49:59 -07:00
  • 05065f7f8c treat http status 999 as forbidden. mwells 2014-07-07 09:46:24 -07:00
  • e22641997a fix geth1tag some more. fixed bad comment tag detection. was losing a good deal of some pages because of that. mwells 2014-07-07 08:20:21 -07:00
  • fed7b73b9f passing qa test again mwells 2014-07-06 22:06:33 -07:00
  • 38e64a6600 update qa loop mwells 2014-07-06 19:43:00 -07:00
  • dc6c97c59c basic qa tests running mwells 2014-07-06 18:53:05 -07:00
  • 4dee019107 qa fixes mwells 2014-07-06 16:47:04 -07:00
  • aeae6bb1a5 qa test updates mwells 2014-07-06 15:04:21 -07:00
  • 70e1eab935 more api updates mwells 2014-07-06 14:13:00 -07:00
  • 97ad9a62e0 support &addcoll= as well as &addColl= mwells 2014-07-06 12:09:41 -07:00
  • 574b3f9354 if netpbm pkg already installed use it. mwells 2014-07-06 09:54:28 -07:00
  • e4f43848d5 fix http://abc.com in sitelist some more. mwells 2014-07-06 08:31:27 -07:00
  • 43d0d636ee fix dmoz building. mwells 2014-07-05 22:20:15 -07:00
  • 6d425d0bdb use changelog in binary packages mwells 2014-07-05 18:51:49 -07:00
  • c8183399de makefile version bump mwells 2014-07-05 16:21:57 -07:00
  • d0345b7c75 fix respider link mwells 2014-07-05 16:19:16 -07:00
  • 183191520f only show h1tag if there mwells 2014-07-05 16:11:17 -07:00
  • 81a89f5975 added support for &geth1tag=1 for xml feeds. mwells 2014-07-05 16:08:48 -07:00
  • 1944b25beb parm fixes mwells 2014-07-05 15:00:16 -07:00
  • d47ffaaa41 fix widget bug mwells 2014-07-05 14:33:41 -07:00
  • af1f7e4590 do not launch spiders if in the middle of exiting. mwells 2014-07-05 14:29:46 -07:00
  • 41804611bb if logging to stderr then return err when trying to fetch logs. mwells 2014-07-05 14:16:33 -07:00
  • 4d98ec7407 fixed some parms bug i made mwells 2014-07-05 14:11:32 -07:00
  • 4059c84074 api updates mwells 2014-07-05 12:47:10 -07:00
  • 29d170631a more api updates mwells 2014-07-05 12:36:01 -07:00
  • cfd36af394 more api updates mwells 2014-07-05 10:16:21 -07:00
  • ea2650292a more api updates. will also be useful for running qa tests. mwells 2014-07-04 20:57:42 -07:00
  • 5a12fa8582 nothing mwells 2014-07-04 17:18:21 -07:00
  • 5bae438169 get 'gb qa' working somewhat. need to have more quick and robust smoketesting. mwells 2014-07-04 17:15:22 -07:00
  • 10bf6c3d35 fix bug in summary display mwells 2014-07-04 15:42:02 -07:00
  • 94d1b4e90c support og:image images. allow user to enter thumbnail max width/height. fix summary printing. was off a little. mwells 2014-07-04 15:33:27 -07:00
  • 641252d052 parm doc updates. mwells 2014-07-04 11:46:25 -07:00
  • 42552ea70a added GET/POST api note mwells 2014-07-04 11:28:40 -07:00
  • 0f3556206b fix default summary m_displayLen bug mwells 2014-07-04 10:55:46 -07:00
  • f9ae433ac1 fix addurl functionality on root page. mwells 2014-07-04 10:43:04 -07:00
  • 79f19d3aa3 addurl interface updates mwells 2014-07-04 10:08:32 -07:00
  • 19a8cd3967 fix statsdb/graph page mwells 2014-07-04 09:53:42 -07:00
  • 6a665430d3 fix federated search. like &c=main+test mwells 2014-07-04 09:31:07 -07:00
  • 3f8bbed3ac added note mwells 2014-07-04 09:22:09 -07:00
  • 888db7787d make patterns in the site list that start with http: or https: behave like the regex ^http: so it requires the start of the url match exactly. fixes pattern "http://xyz.com/". mwells 2014-07-04 09:18:46 -07:00
  • 53a7148634 keep thumbnail gen msgs in the log file mwells 2014-07-04 08:34:42 -07:00
  • dff47bf5cd remove spam checker to make debugging easier Matt Wells 2014-07-03 13:17:28 -07:00
  • d2996bad3a ease up on max redirects limit. was too low. Matt Wells 2014-07-03 13:09:09 -07:00
  • 00e111a182 make spider status msgs clickable to see the urls with that status. Matt Wells 2014-07-03 12:52:44 -07:00
  • c5815829e5 Merge branch 'testing' into diffbot-testing Matt Wells 2014-07-03 12:31:13 -07:00
  • 8ebda5ca51 little comment update Matt Wells 2014-07-03 12:26:02 -07:00
  • 886063a3bd fixes for query reindex. Matt Wells 2014-07-03 12:24:14 -07:00
  • 1586c2dcd7 minor parm change to back what it was mwells 2014-07-03 07:57:08 -07:00
  • 6153cd8ee2 double slash cleanup mwells 2014-07-03 07:39:02 -07:00
  • a327f9ceb0 Merge branch 'diffbot-testing' into testing mwells 2014-07-03 07:30:39 -07:00
  • b0caf3eb00 get summary "ns" parm and collectionrec knobs for summary gen working. mwells 2014-07-03 07:29:44 -07:00
  • 0701411bb1 fix page reindex bugs. Matt Wells 2014-07-02 17:13:37 -07:00
  • 781b26b820 fix so query reindex does not delete the collection. Matt Wells 2014-07-02 16:03:16 -07:00
  • b3b743d111 timezone fix for atotime1() et al Matt Wells 2014-07-02 14:06:43 -07:00
  • 2db8edc527 fix core when doing federated search while streaming. Matt Wells 2014-07-02 12:51:36 -07:00
  • 1361e5728c show actual diffbot error in urls.csv. do not stop indexing page and harvesting links on diffbot error. Matt Wells 2014-07-02 11:53:24 -07:00
  • 5e8c2e4800 fix core from cr being null for page root and not letting searchinput set itself to defaults. Matt Wells 2014-07-02 10:46:58 -07:00
  • af014abdcd title max len fixes. mwells 2014-07-02 08:03:33 -07:00
  • aa331eb880 fix core from null nsr. Matt Wells 2014-07-01 21:12:18 -07:00
  • a699432a99 Merge branch 'diffbot-testing' of github.com:gigablast/open-source-search-engine into diffbot-testing Matt Wells 2014-07-01 17:41:49 -07:00
  • 9f9b33edc9 try to hack fix core when streaming html results back. Matt Wells 2014-07-01 17:18:02 -07:00
  • 2b321efb1e debug log msgs mwells 2014-07-01 16:57:25 -07:00
  • 2ddd7d7366 finally got http tunnel logic working. mwells 2014-07-01 16:28:15 -06:00
  • 2f8b0694fd more http tunnel fixes mwells 2014-07-01 15:43:20 -06:00
  • 5de927f385 some fixes for http proxy tunnel mwells 2014-07-01 15:18:18 -06:00
  • fe8d41a3c3 Merge branch 'diffbot-testing' into diffbot-matt mwells 2014-07-01 14:18:54 -06:00
  • 78c3dab6dc fix hanging when doing &stream=1 for a federated search. hack it so it thinks we got m_msg3a.m_numDocIds summaries after we've printed what was requested. so we don't waste time getting all the summaries. this popped up for federated search because we merged the msg3as into a single msg3a and had an overabundance of docids. Matt Wells 2014-07-01 12:12:01 -07:00
  • d93d44250a fix debug print statements Matt Wells 2014-07-01 11:46:01 -07:00
  • ea2a125a81 Merge branch 'diffbot-testing' into diffbot-matt mwells 2014-07-01 11:46:30 -06:00
  • 69dfd60bf3 Merge branch 'testing' into diffbot-testing mwells 2014-07-01 11:43:22 -06:00
  • 20e0f0eca5 fix buggy title:schmuck OR gbmin:offerPrice query. Matt Wells 2014-07-01 10:15:42 -07:00