Commit Graph

  • 3cbf38be61 Merge pull request #19 from miketung/diffbot-testing Gigablast 2014-09-09 16:36:30 -07:00
  • 0998a4f858 When printing facets in format==FORMAT_JSON: facets can have multiple values. Needs to be a JSON array. Mike Tung 2014-09-09 15:12:33 -07:00
  • dc140e4224 fix /admin/security api mwells 2014-09-09 07:11:41 -07:00
  • da63cb1f17 more website updates for gigabot guide mwells 2014-09-06 21:34:36 -07:00
  • f4e8a8260b update faq.html mwells 2014-09-06 21:17:58 -07:00
  • 8919e95ced turn on images by default now. mwells 2014-09-06 21:13:38 -07:00
  • f21f2a282a gigabot advice updates mwells 2014-09-06 21:05:11 -07:00
  • 7c5ba792d8 added spider log dropdown to search filters bar mwells 2014-09-06 13:05:18 -07:00
  • e41b6109b9 Merge branch 'diffbot-testing' into testing mwells 2014-09-06 12:17:43 -07:00
  • caf8d46b43 index spider status msg for force deletes mwells 2014-09-06 12:15:58 -07:00
  • da4f6782fb syn fix for 'sports' when lang is unknown. we default to english rather than have no synonyms, since it is in french as well. mwells 2014-09-06 10:49:22 -07:00
  • b30bc6cb20 fixed query reindex for query delete. added qa test. mwells 2014-09-06 10:38:32 -07:00
  • 44f00da70e facet fixes mwells 2014-09-05 10:21:02 -07:00
  • 1f61cb0d05 facet fixes mwells 2014-09-05 07:19:22 -07:00
  • 01bf306890 fix json facet output. mwells 2014-09-04 22:44:44 -07:00
  • 79ec8bd268 updates help/syntax page mwells 2014-09-04 22:36:28 -07:00
  • 83f165e1b3 fix facets some more. was not computing over entire set of docids, just the winners. mwells 2014-09-04 22:32:58 -07:00
  • 8dcb9363f9 more facet fixes mwells 2014-09-04 21:07:34 -07:00
  • efc7f3c064 support facet ranges now like gbfacetint:gblang,0-1,1-2,2-3,3-4,5-6,6-18 mwells 2014-09-04 20:41:37 -07:00
  • ccaad9d5da first stab at facet range buckets for histograms. mwells 2014-09-04 14:55:03 -07:00
  • df21306ffe Merge branch 'master' into testing mwells 2014-09-04 11:14:10 -07:00
  • f8487e7132 users.html update mwells 2014-09-04 11:13:32 -07:00
  • f9e5869aed Merge branch 'master' into testing mwells 2014-09-04 11:13:21 -07:00
  • 74385939c6 put dropped requests in bold red mwells 2014-09-04 11:01:49 -07:00
  • 3623a49d6a we no longer use write callbacks they seem pointless. mwells 2014-09-04 10:56:27 -07:00
  • 25669a94f7 Revert "Revert "comment out some webstuff"" mwells 2014-09-04 10:49:13 -07:00
  • 1a3f1f6d09 Revert "Revert "oops forgot loop.h"" mwells 2014-09-04 10:48:48 -07:00
  • be435d83bb Revert "oops forgot loop.h" mwells 2014-09-04 10:47:40 -07:00
  • e392798d3f Revert "comment out some webstuff" mwells 2014-09-04 10:47:31 -07:00
  • e4dc7c6d32 comment out some webstuff mwells 2014-09-04 10:44:25 -07:00
  • 8be64f69e4 Merge branch 'testing' mwells 2014-09-04 10:43:08 -07:00
  • a511274e47 udpserver slot overload fixes mwells 2014-09-04 10:41:57 -07:00
  • 0aaea60b6b oops forgot loop.h mwells 2014-09-04 10:35:07 -07:00
  • ea30e8d9b2 try to fix request rate limiting jam up in udpserver.cpp. pretty up Loop.cpp some more which now uses select(). mwells 2014-09-04 10:28:40 -07:00
  • ed66bf57b7 git ride of select on writefds. pretty pointless unless we tried to write to a socket before and the buffer was full so the write failed. then we'd want to know if it was ready for writing again i guess.... anyway i'm not so sure that happens a lot so i took it out and i guess we'll see what happens. also added more udp/loop debugging statements. Matt Wells 2014-09-03 21:24:51 -07:00
  • 0bcecfb0ce Merge branch 'diffbot-testing' of github.com:gigablast/open-source-search-engine into diffbot-testing Matt Wells 2014-09-03 20:03:38 -07:00
  • 217fb1f1e9 Merge branch 'testing' into diffbot-testing Matt Wells 2014-09-03 20:00:04 -07:00
  • 9e1332d34d Merge branch 'testing' into diffbot-testing Matt Wells 2014-09-03 11:32:47 -07:00
  • 4bf7826acf limit pdftohtml to 30s mwells 2014-09-03 11:30:54 -07:00
  • 21c32be2b2 try to tune query performance with higher max outstanding thread defaults. mwells 2014-09-03 11:08:01 -07:00
  • 7e3215d176 verified SIGCHLD being sent when thread completes mwells 2014-09-03 11:05:15 -07:00
  • c1a8053b52 compiler warning abatement mwells 2014-09-03 10:55:29 -07:00
  • 7bd3b42477 comment out unused code. make thread cleanups basically untimed. mwells 2014-09-03 09:48:43 -07:00
  • 55a1a4626f more signal count stats mwells 2014-09-03 09:18:30 -07:00
  • bb76ffa8b2 do not bother doing fcntl(F_SETSIG,fd) since we use select() now. however we still do get thread-cleanup signals from a thread when it is done. mwells 2014-09-03 09:02:57 -07:00
  • cd97944d1e fix cpu usage reporting mwells 2014-09-03 08:38:43 -07:00
  • 959187893e default parm val updates for easier max spiders setting mwells 2014-09-03 07:48:07 -07:00
  • 5931fb5ff1 raised MAX_SPIDERS from 100 to 300. watch out for oom though. mwells 2014-09-03 07:26:17 -07:00
  • 5a508cad69 upped MAX_SPIDERS from 100 to 300. watch out for oom though. mwells 2014-09-03 07:25:40 -07:00
  • 7866ff1196 added some files mwells 2014-09-03 06:40:04 -07:00
  • d9462e740b handle threads on EINTR from select() in Loop.cpp. mwells 2014-09-03 06:39:19 -07:00
  • 82edb9814f Merge branch 'master' into testing mwells 2014-09-03 06:10:03 -07:00
  • 19feb94f29 imported new Loop.cpp using select() from testing branch so ppl can use it on ubuntu 14.04 mwells 2014-09-02 22:22:39 -07:00
  • 71e4574c71 added status file type to search filter bar mwells 2014-09-02 22:17:27 -07:00
  • 048aa60fd9 try polling fds now with select(). real-time signals were somewhat of an artifact from bygone days and the select() call when done right seems to not use any more cpu. plus it should work well with apple os x and cygwin etc. mwells 2014-09-02 22:05:15 -07:00
  • 8f92a6d767 fix for diffbot. use objects not results for global-index. Matt Wells 2014-09-02 15:02:12 -07:00
  • 2c81157031 fix search filters bar not coming up mwells 2014-09-02 14:07:04 -07:00
  • c82c41540c if crawlbot, do not overwrite sitenuminlinks stuff in tagdb, because we imported that from big se. Matt Wells 2014-09-02 13:45:58 -07:00
  • c392e6eade fix minserpscore parm. support TYPE_DOUBLE parms. mwells 2014-09-01 20:37:17 -07:00
  • 91ab558eb5 fix widget from initializing with serps because 'cd' was a text node, "No Results Found..." and did not have the getAttribute() function available thus causing a javascript error. mwells 2014-09-01 19:24:04 -07:00
  • 65e533bbb7 website updates mwells 2014-09-01 17:23:15 -07:00
  • 58d8861a34 widget page updates mwells 2014-09-01 17:04:08 -07:00
  • 48a3239276 spider status widget fixes mwells 2014-09-01 14:22:03 -07:00
  • 25b79684c5 website gui fixes mwells 2014-09-01 13:31:13 -07:00
  • 2ccc4626dc more support for cloud initiative mwells 2014-08-31 21:55:27 -07:00
  • 7f622bd416 fixes for cloud support. mwells 2014-08-31 16:23:11 -07:00
  • 0aca84a5b2 website updates mwells 2014-08-31 15:35:53 -07:00
  • 1bc5fecb33 website updates mwells 2014-08-31 11:11:12 -07:00
  • ef8cb47590 website updates. mwells 2014-08-31 10:51:37 -07:00
  • 0d4baf228e fix display of dmoz pages mwells 2014-08-31 10:11:16 -07:00
  • 754d5b4755 rename admin.html to faq.html etc. file juggling. mwells 2014-08-31 09:51:21 -07:00
  • 6a5ad8a3e4 bio update mwells 2014-08-30 22:41:44 -07:00
  • c68874240a gigabot advice blurb fixes mwells 2014-08-30 22:26:12 -07:00
  • 253ed05074 gigabot advice blurb. mwells 2014-08-30 22:14:53 -07:00
  • 61f919a19c website updates mwells 2014-08-30 21:25:14 -07:00
  • aafacf5207 website updates mwells 2014-08-30 20:49:11 -07:00
  • acd1672277 get crawlinfo every 3 seonds not 5. mwells 2014-08-29 16:18:46 -07:00
  • 2fdea42e78 lower SPIDER_DONE_TIMER so we can be more zippy to complete jobs and also so we can pass smoke tests for crawlCreationCompletionTime etc. mwells 2014-08-29 13:50:25 -07:00
  • 9d6437c2f8 remove html column from data csv output of json objects. mwells 2014-08-29 11:58:30 -07:00
  • 9de4e4bf3d Merge branch 'testing' into diffbot-testing mwells 2014-08-29 11:23:13 -07:00
  • aa966043ad added quickpolls. mwells 2014-08-28 19:45:25 -07:00
  • fb161e0102 more minor bug fixes. mwells 2014-08-28 18:11:07 -07:00
  • 4dbc6ed745 minor bug fix mwells 2014-08-28 18:09:23 -07:00
  • 060e887f08 misc/various bug fixes. fix canonical redir url bug with iframes. mwells 2014-08-28 18:07:22 -07:00
  • 3457245893 fix printf compiler warnings mwells 2014-08-28 13:23:46 -07:00
  • caee238c46 fixes to make easier to compile on max os x. mwells 2014-08-28 12:55:02 -07:00
  • 94e8b43cc9 fix bug of not running df -ka to get disk usage mwells 2014-08-28 09:49:47 -07:00
  • c641666b45 Merge branch 'master' of github.com:gigablast/open-source-search-engine mwells 2014-08-28 08:52:48 -06:00
  • adcdf672bd Merge branch 'testing' mwells 2014-08-28 07:46:35 -07:00
  • 38cef7d52e fix # docs and recs bug. mwells 2014-08-28 07:45:43 -07:00
  • e7aa933959 awesome updates to help.html page mwells 2014-08-27 22:21:30 -07:00
  • 51baca8917 Merge branch 'testing' of github.com:gigablast/open-source-search-engine into testing mwells 2014-08-27 23:11:18 -06:00
  • ad8168f214 updates for query help table mwells 2014-08-27 23:10:27 -06:00
  • 8f6b82261f makefile updates mwells 2014-08-27 22:09:21 -06:00
  • f7af7ea2af re-enable support for canonical url "redirects". mwells 2014-08-27 19:28:48 -07:00
  • 2a34e6b2b8 Merge branch 'testing' mwells 2014-08-27 19:25:16 -07:00
  • c4a0967b12 remove copyright all rights reserved from serps if custom html tail provided. mwells 2014-08-27 19:24:40 -07:00
  • 3c980fe592 Merge branch 'testing' mwells 2014-08-27 20:00:49 -06:00
  • 203754b78a makefile updates mwells 2014-08-27 19:59:57 -06:00
  • 52e601e27a Merge branch 'testing' of git@github.com:gigablast/open-source-search-engine into testing Matt Wells 2014-08-27 18:30:01 -07:00