Commit Graph

55 Commits

Author SHA1 Message Date
3c2773a928 Removed gbstrlen()
gbstrlen() just checked for NULL and called gbshutdownAbort(). Dereferncing NULL on mordern platforms cases a SIGSEGV which is cought by our signal handler and .... gbshutdownAbort() is called. So gbstrlen() was superfluous and complicated static analysis.
2016-07-28 17:04:35 +02:00
63b4b38816 Fix addMatches segfault 2016-06-28 11:48:30 +02:00
f3f5eefcb6 First batch of changes streamlining emergency shutdown code 2016-06-20 12:30:26 +02:00
b5da0d29ec Fix all caps PDF title 2016-06-09 14:52:18 +02:00
ad7bb591af Re-added private access specifier in Words class
Caused quite a lot of changes where other code hand its dirty hands in the
innards of Words. Added necessary accessor methods, and used the opportunity to
add const if possible.
2016-05-23 16:41:14 +02:00
9b410e5e5b bugfix 64/32 bit bug introduced in commit 96b8197ad3 2016-05-23 15:41:19 +02:00
d40ecb2f8e Replace INT32/INT64 and likes with PRId32 and likes. Add space before definition. 2016-05-20 09:18:32 +02:00
9fdfd046a1 Rename strnstr2 to strnstr 2016-05-10 16:15:32 +02:00
bf57e59590 Use queries map when we're not matching partial/case sensitive.
Remove more tracking params.
Combine strip session id  & tracking param into one setting. Doesn't make sense to have 2 separate setting.
2016-05-10 16:15:31 +02:00
09c15a8f88 More Url::set changes. Add more overloads for all false parameters. 2016-04-05 23:21:51 +02:00
34a4d08cd1 Add Url::set overload for all false boolean paramater 2016-04-05 23:21:50 +02:00
5ccfde3e2a Remove titleRecVersion from Url.h/Url.cpp. Checks there are versions from before gigablast was open-sourced 2016-04-05 23:21:50 +02:00
8c118f4976 valgrind bugfix correct test in Title.cpp 2016-03-08 11:59:25 +01:00
6a49ef71df XmlDoc::*title* bugfixes
The title buffers are all arrays so &array won't work to return a char**.
Removed bug-introducing static variable with bogus comment.
2016-03-08 11:10:50 +01:00
628744f25a bugfix buffer overrun 2016-03-07 17:38:42 +01:00
2ae1412a24 Fix log message in Title 2016-02-26 10:10:35 +01:00
b85e0c3062 Split Title::setTitle and Title::setTitleFromTags so we only initialize Words when we need to 2016-02-25 13:45:34 +01:00
e384f531e4 Decouple Title from XmlDoc for unit test 2016-02-18 14:55:34 +01:00
feb55fcbe6 Remove noop operation XmlDoc::getLinkInfo2 (always return NULL) 2016-02-16 16:56:32 +01:00
439b855136 Trying to fix summary best window by playing around with score. Hopefully this will give a better summary for wikipedia and some other sites 2016-01-20 15:57:11 +01:00
ac8249e07d Improve title/summary for youtube 2016-01-20 13:32:13 +01:00
40af20df8d Tweak min title length 2016-01-20 10:35:01 +01:00
80488f1444 Fix title logging 2016-01-18 17:15:45 +01:00
e0fac73695 Got rid of default parameter values in Url.cpp set functions. Added support for removal of common tracking parameters in URLs. 2016-01-17 22:49:02 +01:00
c0b519bfdb We should only use meta tags for HTML documents 2016-01-15 16:25:08 +01:00
bed3182988 Use meta tags (og:title & title) & title tag when available for generating title 2016-01-15 15:53:14 +01:00
63d2ba1a76 Remove adding of offerPrice to title for json documents.
Don't use title from A tag when it contains "share" in it.
2016-01-13 11:52:10 +01:00
2c14f659e4 Remove similar/unused Words::set methods 2016-01-12 11:46:28 +01:00
3cc7a032e5 Remove unused/'constant' variable in Matches::addMatches. Modify int16_tcuts & similar to shortcuts & similar. 2016-01-04 16:00:39 +01:00
691ef745d9 Fix PDF file name for when url has query string 2015-12-04 13:19:31 +01:00
72d7a0a3f4 Remove unwanted utf-8 symbols from title 2015-12-02 14:05:42 +01:00
0884edf08e Fix title for PDF files & add some simple tests for it 2015-12-01 12:38:51 +01:00
80f7c4017f Remove Words::m_version
Removed m_version memeber which was write-only. Removed version parameter to
set() methods.
2015-11-30 16:24:58 +01:00
2a061dcafd Remove unused code RequestTable. Remove commented out code. Remove unused methods. 2015-11-26 15:31:55 +01:00
016def61a4 Remove m_htmlEncoded from Title. It's set but not used 2015-11-26 14:19:59 +01:00
e26454c73a Minor code cleanup for Title 2015-11-26 11:57:21 +01:00
d53a4eb811 Remove commented out code, unused variable, general cleanup 2015-11-25 16:51:27 +01:00
2413a9b9b1 query stop words now based on selected langid. 2015-03-08 15:16:24 -07:00
87285ba3cd use gbmemcpy not memcpy so we can get profiler working again
since memcpy can't be interrupted and backtrace() called.
2015-01-13 12:25:42 -07:00
4e8a42e024 text replacements for bad int32_t substitutions 2014-11-17 18:24:38 -08:00
931a1c4bc6 good checkpoint. quite a few fixes. 2014-11-17 18:13:36 -08:00
96b8197ad3 now it compiles with -m32 2014-11-10 14:45:11 -08:00
e7dd8f7956 replace long long with int64_t 2014-10-30 13:36:39 -06:00
6e345227a8 qa test fixes 2014-07-15 10:06:33 -07:00
af014abdcd title max len fixes. 2014-07-02 08:03:33 -07:00
a2c6527ada put thumbnail in proper proportion.
other formatting fixes.
2014-04-28 14:14:18 -07:00
9c8410767d fix critical title alloc/free bug
in title.cpp.
2014-03-28 08:01:01 -07:00
c596d38e60 fix core from getting title of json object 2014-02-28 08:18:09 -08:00
82494baa89 move CollectionRec stuff into Collectiondb files
for simplicity.
2013-12-10 15:28:04 -08:00
e0a15194e1 fix json double decoding issue. no more
partial decodes, json parser stores
fully decoded string into separate buf.
2013-11-22 14:16:14 -08:00