7c5ba792d8added spider log dropdown to search filters bar
mwells
2014-09-06 13:05:18 -07:00
e41b6109b9Merge branch 'diffbot-testing' into testing
mwells
2014-09-06 12:17:43 -07:00
caf8d46b43index spider status msg for force deletes
mwells
2014-09-06 12:15:58 -07:00
da4f6782fbsyn fix for 'sports' when lang is unknown. we default to english rather than have no synonyms, since it is in french as well.
mwells
2014-09-06 10:49:22 -07:00
ea30e8d9b2try to fix request rate limiting jam up in udpserver.cpp. pretty up Loop.cpp some more which now uses select().
mwells
2014-09-04 10:28:40 -07:00
ed66bf57b7git ride of select on writefds. pretty pointless unless we tried to write to a socket before and the buffer was full so the write failed. then we'd want to know if it was ready for writing again i guess.... anyway i'm not so sure that happens a lot so i took it out and i guess we'll see what happens. also added more udp/loop debugging statements.
Matt Wells
2014-09-03 21:24:51 -07:00
0bcecfb0ceMerge branch 'diffbot-testing' of github.com:gigablast/open-source-search-engine into diffbot-testing
Matt Wells
2014-09-03 20:03:38 -07:00
217fb1f1e9Merge branch 'testing' into diffbot-testing
Matt Wells
2014-09-03 20:00:04 -07:00
9e1332d34dMerge branch 'testing' into diffbot-testing
Matt Wells
2014-09-03 11:32:47 -07:00
4bf7826acflimit pdftohtml to 30s
mwells
2014-09-03 11:30:54 -07:00
21c32be2b2try to tune query performance with higher max outstanding thread defaults.
mwells
2014-09-03 11:08:01 -07:00
7e3215d176verified SIGCHLD being sent when thread completes
mwells
2014-09-03 11:05:15 -07:00
7bd3b42477comment out unused code. make thread cleanups basically untimed.
mwells
2014-09-03 09:48:43 -07:00
55a1a4626fmore signal count stats
mwells
2014-09-03 09:18:30 -07:00
bb76ffa8b2do not bother doing fcntl(F_SETSIG,fd) since we use select() now. however we still do get thread-cleanup signals from a thread when it is done.
mwells
2014-09-03 09:02:57 -07:00
cd97944d1efix cpu usage reporting
mwells
2014-09-03 08:38:43 -07:00
959187893edefault parm val updates for easier max spiders setting
mwells
2014-09-03 07:48:07 -07:00
5931fb5ff1raised MAX_SPIDERS from 100 to 300. watch out for oom though.
mwells
2014-09-03 07:26:17 -07:00
5a508cad69upped MAX_SPIDERS from 100 to 300. watch out for oom though.
mwells
2014-09-03 07:25:40 -07:00
7866ff1196added some files
mwells
2014-09-03 06:40:04 -07:00
d9462e740bhandle threads on EINTR from select() in Loop.cpp.
mwells
2014-09-03 06:39:19 -07:00
82edb9814fMerge branch 'master' into testing
mwells
2014-09-03 06:10:03 -07:00
19feb94f29imported new Loop.cpp using select() from testing branch so ppl can use it on ubuntu 14.04
mwells
2014-09-02 22:22:39 -07:00
71e4574c71added status file type to search filter bar
mwells
2014-09-02 22:17:27 -07:00
048aa60fd9try polling fds now with select(). real-time signals were somewhat of an artifact from bygone days and the select() call when done right seems to not use any more cpu. plus it should work well with apple os x and cygwin etc.
mwells
2014-09-02 22:05:15 -07:00
8f92a6d767fix for diffbot. use objects not results for global-index.
Matt Wells
2014-09-02 15:02:12 -07:00
2c81157031fix search filters bar not coming up
mwells
2014-09-02 14:07:04 -07:00
c82c41540cif crawlbot, do not overwrite sitenuminlinks stuff in tagdb, because we imported that from big se.
Matt Wells
2014-09-02 13:45:58 -07:00
c392e6eadefix minserpscore parm. support TYPE_DOUBLE parms.
mwells
2014-09-01 20:37:17 -07:00
91ab558eb5fix widget from initializing with serps because 'cd' was a text node, "No Results Found..." and did not have the getAttribute() function available thus causing a javascript error.
mwells
2014-09-01 19:24:04 -07:00
acd1672277get crawlinfo every 3 seonds not 5.
mwells
2014-08-29 16:18:46 -07:00
2fdea42e78lower SPIDER_DONE_TIMER so we can be more zippy to complete jobs and also so we can pass smoke tests for crawlCreationCompletionTime etc.
mwells
2014-08-29 13:50:25 -07:00
9d6437c2f8remove html column from data csv output of json objects.
mwells
2014-08-29 11:58:30 -07:00
9de4e4bf3dMerge branch 'testing' into diffbot-testing
mwells
2014-08-29 11:23:13 -07:00