0998a4f858
When printing facets in format==FORMAT_JSON: facets can have multiple values. Needs to be a JSON array.
Mike Tung
2014-09-09 15:12:33 -07:00
dc140e4224
fix /admin/security api
mwells
2014-09-09 07:11:41 -07:00
da63cb1f17
more website updates for gigabot guide
mwells
2014-09-06 21:34:36 -07:00
7c5ba792d8
added spider log dropdown to search filters bar
mwells
2014-09-06 13:05:18 -07:00
e41b6109b9
Merge branch 'diffbot-testing' into testing
mwells
2014-09-06 12:17:43 -07:00
caf8d46b43
index spider status msg for force deletes
mwells
2014-09-06 12:15:58 -07:00
da4f6782fb
syn fix for 'sports' when lang is unknown. we default to english rather than have no synonyms, since it is in french as well.
mwells
2014-09-06 10:49:22 -07:00
ea30e8d9b2
try to fix request rate limiting jam up in udpserver.cpp. pretty up Loop.cpp some more which now uses select().
mwells
2014-09-04 10:28:40 -07:00
ed66bf57b7
git ride of select on writefds. pretty pointless unless we tried to write to a socket before and the buffer was full so the write failed. then we'd want to know if it was ready for writing again i guess.... anyway i'm not so sure that happens a lot so i took it out and i guess we'll see what happens. also added more udp/loop debugging statements.
Matt Wells
2014-09-03 21:24:51 -07:00
0bcecfb0ce
Merge branch 'diffbot-testing' of github.com:gigablast/open-source-search-engine into diffbot-testing
Matt Wells
2014-09-03 20:03:38 -07:00
217fb1f1e9
Merge branch 'testing' into diffbot-testing
Matt Wells
2014-09-03 20:00:04 -07:00
9e1332d34d
Merge branch 'testing' into diffbot-testing
Matt Wells
2014-09-03 11:32:47 -07:00
4bf7826acf
limit pdftohtml to 30s
mwells
2014-09-03 11:30:54 -07:00
21c32be2b2
try to tune query performance with higher max outstanding thread defaults.
mwells
2014-09-03 11:08:01 -07:00
7e3215d176
verified SIGCHLD being sent when thread completes
mwells
2014-09-03 11:05:15 -07:00
7bd3b42477
comment out unused code. make thread cleanups basically untimed.
mwells
2014-09-03 09:48:43 -07:00
55a1a4626f
more signal count stats
mwells
2014-09-03 09:18:30 -07:00
bb76ffa8b2
do not bother doing fcntl(F_SETSIG,fd) since we use select() now. however we still do get thread-cleanup signals from a thread when it is done.
mwells
2014-09-03 09:02:57 -07:00
cd97944d1e
fix cpu usage reporting
mwells
2014-09-03 08:38:43 -07:00
959187893e
default parm val updates for easier max spiders setting
mwells
2014-09-03 07:48:07 -07:00
5931fb5ff1
raised MAX_SPIDERS from 100 to 300. watch out for oom though.
mwells
2014-09-03 07:26:17 -07:00
5a508cad69
upped MAX_SPIDERS from 100 to 300. watch out for oom though.
mwells
2014-09-03 07:25:40 -07:00
7866ff1196
added some files
mwells
2014-09-03 06:40:04 -07:00
d9462e740b
handle threads on EINTR from select() in Loop.cpp.
mwells
2014-09-03 06:39:19 -07:00
82edb9814f
Merge branch 'master' into testing
mwells
2014-09-03 06:10:03 -07:00
19feb94f29
imported new Loop.cpp using select() from testing branch so ppl can use it on ubuntu 14.04
mwells
2014-09-02 22:22:39 -07:00
71e4574c71
added status file type to search filter bar
mwells
2014-09-02 22:17:27 -07:00
048aa60fd9
try polling fds now with select(). real-time signals were somewhat of an artifact from bygone days and the select() call when done right seems to not use any more cpu. plus it should work well with apple os x and cygwin etc.
mwells
2014-09-02 22:05:15 -07:00
8f92a6d767
fix for diffbot. use objects not results for global-index.
Matt Wells
2014-09-02 15:02:12 -07:00
2c81157031
fix search filters bar not coming up
mwells
2014-09-02 14:07:04 -07:00
c82c41540c
if crawlbot, do not overwrite sitenuminlinks stuff in tagdb, because we imported that from big se.
Matt Wells
2014-09-02 13:45:58 -07:00
91ab558eb5
fix widget from initializing with serps because 'cd' was a text node, "No Results Found..." and did not have the getAttribute() function available thus causing a javascript error.
mwells
2014-09-01 19:24:04 -07:00
acd1672277
get crawlinfo every 3 seonds not 5.
mwells
2014-08-29 16:18:46 -07:00
2fdea42e78
lower SPIDER_DONE_TIMER so we can be more zippy to complete jobs and also so we can pass smoke tests for crawlCreationCompletionTime etc.
mwells
2014-08-29 13:50:25 -07:00
9d6437c2f8
remove html column from data csv output of json objects.
mwells
2014-08-29 11:58:30 -07:00
9de4e4bf3d
Merge branch 'testing' into diffbot-testing
mwells
2014-08-29 11:23:13 -07:00