Commit Graph

12 Commits

Author SHA1 Message Date
6d7e14d2a4 Corrects compiler warning: C++11 requires a space between string literal and macro 2021-06-19 17:17:06 +00:00
b8e4046d61 remove unnecessary line 2015-09-13 17:54:38 -07:00
65613feb4c fix bug of not using part files when generating map 2015-09-13 17:52:40 -07:00
fd6875b94c make warc reading use a thread in xmldoc.cpp 2015-09-12 11:42:27 -06:00
ad68738b1d fix compiler warnings 2015-09-10 13:24:59 -06:00
09de59f026 do not store cblock, etc. tags into tagdb to save
disk space. added tagdb file cache for better performance,
less disk accesses. will help reduce disk load.
put file cache sizes in master controls and if they change
then update the cache size dynamically.
2015-09-10 12:46:00 -06:00
a1ed368d82 bring back max mem control into master controls.
it's useful to limit per process mem usage to prevent
oom killer because we can't save if we get killed.
overhaul diskpagecache to just use rdbcache. much simpler
and faster, but disabled for now until debugged more.
reduce min files to merge for crawlbot collections so
they stay more tightly merged to conserve fds and mem.
improved logDebugDisk msgs.
overhauled File.cpp fd pool. now it is way faster and
doesn't use any extra mem. much simpler too. although
could be sped up a little by using a linked list, but
probably is not significant enough to warrant doing right now.
increase mem ptr table from 3M to 8M slots. should really make
dynamic though. fix core from null msg20s[0]->m_r.
only call attemptMergeAll once every 60 seconds really.
do not attempt merge if already merging.
2015-08-14 12:58:54 -06:00
adcef39376 Merge branch 'diffbot-testing' into diffbot-matt
Conflicts:
	Collectiondb.cpp
	Collectiondb.h
	Conf.cpp
	Conf.h
	Msg39.cpp
	PageEvents.cpp
	PageResults.cpp
	PageTurk.cpp
	Pages.cpp
	Parms.cpp
	Posdb.cpp
	Proxy.cpp
	Query.cpp
	Query.h
	RdbBase.cpp
	RdbMap.cpp
	Repair.cpp
	Repair.h
	SafeBuf.cpp
	Spider.cpp
	Tagdb.cpp
	TopTree.cpp
	XmlDoc.cpp
	main.cpp
2014-11-20 16:53:07 -08:00
bf2013345d fix up diskpagecache. how did it work before
without storing the vfd? because linked list was
over many different vfds, but the map from one diskpage
to a mem offset was specific to each vfd.
2014-11-20 15:05:34 -08:00
96b8197ad3 now it compiles with -m32 2014-11-10 14:45:11 -08:00
e7dd8f7956 replace long long with int64_t 2014-10-30 13:36:39 -06:00
f6e560c1f4 Initial file population. 2013-08-02 13:12:24 -07:00