3c2773a928
Removed gbstrlen()
...
gbstrlen() just checked for NULL and called gbshutdownAbort(). Dereferncing NULL on mordern platforms cases a SIGSEGV which is cought by our signal handler and .... gbshutdownAbort() is called. So gbstrlen() was superfluous and complicated static analysis.
2016-07-28 17:04:35 +02:00
63b4b38816
Fix addMatches segfault
2016-06-28 11:48:30 +02:00
f3f5eefcb6
First batch of changes streamlining emergency shutdown code
2016-06-20 12:30:26 +02:00
b5da0d29ec
Fix all caps PDF title
2016-06-09 14:52:18 +02:00
ad7bb591af
Re-added private access specifier in Words class
...
Caused quite a lot of changes where other code hand its dirty hands in the
innards of Words. Added necessary accessor methods, and used the opportunity to
add const if possible.
2016-05-23 16:41:14 +02:00
9b410e5e5b
bugfix 64/32 bit bug introduced in commit 96b8197ad3
2016-05-23 15:41:19 +02:00
d40ecb2f8e
Replace INT32/INT64 and likes with PRId32 and likes. Add space before definition.
2016-05-20 09:18:32 +02:00
9fdfd046a1
Rename strnstr2 to strnstr
2016-05-10 16:15:32 +02:00
bf57e59590
Use queries map when we're not matching partial/case sensitive.
...
Remove more tracking params.
Combine strip session id & tracking param into one setting. Doesn't make sense to have 2 separate setting.
2016-05-10 16:15:31 +02:00
09c15a8f88
More Url::set changes. Add more overloads for all false parameters.
2016-04-05 23:21:51 +02:00
34a4d08cd1
Add Url::set overload for all false boolean paramater
2016-04-05 23:21:50 +02:00
5ccfde3e2a
Remove titleRecVersion from Url.h/Url.cpp. Checks there are versions from before gigablast was open-sourced
2016-04-05 23:21:50 +02:00
8c118f4976
valgrind bugfix correct test in Title.cpp
2016-03-08 11:59:25 +01:00
6a49ef71df
XmlDoc::*title* bugfixes
...
The title buffers are all arrays so &array won't work to return a char**.
Removed bug-introducing static variable with bogus comment.
2016-03-08 11:10:50 +01:00
628744f25a
bugfix buffer overrun
2016-03-07 17:38:42 +01:00
2ae1412a24
Fix log message in Title
2016-02-26 10:10:35 +01:00
b85e0c3062
Split Title::setTitle and Title::setTitleFromTags so we only initialize Words when we need to
2016-02-25 13:45:34 +01:00
e384f531e4
Decouple Title from XmlDoc for unit test
2016-02-18 14:55:34 +01:00
feb55fcbe6
Remove noop operation XmlDoc::getLinkInfo2 (always return NULL)
2016-02-16 16:56:32 +01:00
439b855136
Trying to fix summary best window by playing around with score. Hopefully this will give a better summary for wikipedia and some other sites
2016-01-20 15:57:11 +01:00
ac8249e07d
Improve title/summary for youtube
2016-01-20 13:32:13 +01:00
40af20df8d
Tweak min title length
2016-01-20 10:35:01 +01:00
80488f1444
Fix title logging
2016-01-18 17:15:45 +01:00
e0fac73695
Got rid of default parameter values in Url.cpp set functions. Added support for removal of common tracking parameters in URLs.
2016-01-17 22:49:02 +01:00
c0b519bfdb
We should only use meta tags for HTML documents
2016-01-15 16:25:08 +01:00
bed3182988
Use meta tags (og:title & title) & title tag when available for generating title
2016-01-15 15:53:14 +01:00
63d2ba1a76
Remove adding of offerPrice to title for json documents.
...
Don't use title from A tag when it contains "share" in it.
2016-01-13 11:52:10 +01:00
2c14f659e4
Remove similar/unused Words::set methods
2016-01-12 11:46:28 +01:00
3cc7a032e5
Remove unused/'constant' variable in Matches::addMatches. Modify int16_tcuts & similar to shortcuts & similar.
2016-01-04 16:00:39 +01:00
691ef745d9
Fix PDF file name for when url has query string
2015-12-04 13:19:31 +01:00
72d7a0a3f4
Remove unwanted utf-8 symbols from title
2015-12-02 14:05:42 +01:00
0884edf08e
Fix title for PDF files & add some simple tests for it
2015-12-01 12:38:51 +01:00
80f7c4017f
Remove Words::m_version
...
Removed m_version memeber which was write-only. Removed version parameter to
set() methods.
2015-11-30 16:24:58 +01:00
2a061dcafd
Remove unused code RequestTable. Remove commented out code. Remove unused methods.
2015-11-26 15:31:55 +01:00
016def61a4
Remove m_htmlEncoded from Title. It's set but not used
2015-11-26 14:19:59 +01:00
e26454c73a
Minor code cleanup for Title
2015-11-26 11:57:21 +01:00
d53a4eb811
Remove commented out code, unused variable, general cleanup
2015-11-25 16:51:27 +01:00
2413a9b9b1
query stop words now based on selected langid.
2015-03-08 15:16:24 -07:00
87285ba3cd
use gbmemcpy not memcpy so we can get profiler working again
...
since memcpy can't be interrupted and backtrace() called.
2015-01-13 12:25:42 -07:00
4e8a42e024
text replacements for bad int32_t substitutions
2014-11-17 18:24:38 -08:00
931a1c4bc6
good checkpoint. quite a few fixes.
2014-11-17 18:13:36 -08:00
96b8197ad3
now it compiles with -m32
2014-11-10 14:45:11 -08:00
e7dd8f7956
replace long long with int64_t
2014-10-30 13:36:39 -06:00
6e345227a8
qa test fixes
2014-07-15 10:06:33 -07:00
af014abdcd
title max len fixes.
2014-07-02 08:03:33 -07:00
a2c6527ada
put thumbnail in proper proportion.
...
other formatting fixes.
2014-04-28 14:14:18 -07:00
9c8410767d
fix critical title alloc/free bug
...
in title.cpp.
2014-03-28 08:01:01 -07:00
c596d38e60
fix core from getting title of json object
2014-02-28 08:18:09 -08:00
82494baa89
move CollectionRec stuff into Collectiondb files
...
for simplicity.
2013-12-10 15:28:04 -08:00
e0a15194e1
fix json double decoding issue. no more
...
partial decodes, json parser stores
fully decoded string into separate buf.
2013-11-22 14:16:14 -08:00