Commit Graph

144 Commits

Author SHA1 Message Date
4a6e93fc78 Add configurable xpath for summary 2016-09-01 13:43:07 +02:00
39c47bd469 Fix wrongly committed code 2016-08-11 13:57:44 +02:00
0ac52d0cb9 Code style changes 2016-08-11 12:58:51 +02:00
4f7cdc3a42 Fix compilation error in unit test 2016-08-08 15:43:36 +02:00
24657cd424 Fix memleak on BigFileTest 2016-07-11 13:56:50 +02:00
4d01557b00 Fix bug when handling extra long url (url longer than MAX_URL_LEN) 2016-07-05 12:19:33 +02:00
ac790137a8 Disable log trace for unit test 2016-07-04 16:16:49 +02:00
c5cc33ab42 Check for file existence before renaming to avoid removing files 2016-07-04 16:16:49 +02:00
51b74f69ff Add UrlParserTest for scheme & relative scheme 2016-06-29 11:36:34 +02:00
a1dacc4070 histogram statistis on queries 2016-06-13 16:54:56 +02:00
a49e7db197 Fix system test for summary with ellipsis 2016-06-10 14:36:58 +02:00
796332a9bb Change sample document to simpler example 2016-06-09 18:11:01 +02:00
f83ae887a4 Fix summary generation for default summary 2016-06-09 17:06:29 +02:00
17d5004f0e Add todo notes 2016-06-09 14:52:18 +02:00
d4ceb566c5 Add some test for Url::GetSubPathLen 2016-06-02 13:07:23 +02:00
d8ece42394 Add/change todo comments 2016-06-01 11:10:06 +02:00
e4d192eab6 Don't pass NULL into strncmp 2016-05-31 12:42:25 +02:00
5d24124a78 Fix UrlParser logic to be more similar to Url 2016-05-31 12:04:33 +02:00
6a63a2ce89 Add criteria for stripping session parameter. Add missing tests. 2016-05-30 12:49:43 +02:00
e8fb2d0987 Move around UrlTest so that it's similar to the structure in Url.cpp
Add UrlComponent::Validator test
2016-05-30 12:49:43 +02:00
74de317507 Initial set of test to cover UrlComponent normalization and UrlComponent::Matcher 2016-05-27 12:24:47 +02:00
c989f846eb Add IDN test for UrlParser 2016-05-27 12:23:36 +02:00
016be95366 Fix bug in UrlComponent where wrong range of hex was checked.
Invalid % encoding should encode % instead of deleting it
2016-05-27 12:21:46 +02:00
9bbf010487 Add more test for apache sort & oscommerce parameters 2016-05-26 11:26:00 +02:00
364faee241 Fix core dump when ip address is used in url 2016-05-25 17:55:29 +02:00
59d431c3b1 Simplify code for removing apache dir sort parameters 2016-05-25 17:26:34 +02:00
c08485008a Add test for url with user info 2016-05-25 16:14:06 +02:00
544b004f14 Remove affiliate links 2016-05-25 15:49:09 +02:00
afa2ce6208 Extend UrlParser to detect domain 2016-05-24 17:06:15 +02:00
23f82c0dcf Add another unit test to show bug in ellipsis 2016-05-24 16:08:31 +02:00
fc88be4c79 More Url test using tuple 2016-05-24 15:48:09 +02:00
acd31d066f Use $(MAKE) instead of make 2016-05-24 15:48:09 +02:00
6716f4c9dc Change UrlTest from using 2 char array to tuple 2016-05-24 15:48:09 +02:00
8362d880f7 Add unit test to demonstrate behaviour where ellipsis is not added.
Yet another place to use horizontal ellipsis instead of three dots
2016-05-24 13:08:35 +02:00
3bf9f64109 Fix valgrind invalid read error on Robots parsing 2016-05-24 12:06:43 +02:00
8f2843f7ba Add config=coverage to enable test coverage analysis 2016-05-23 17:32:18 +02:00
205edffd4b Ignore exit code of test target in Makefile 2016-05-19 11:06:38 +02:00
567471478a Simplify UrlTest 2016-05-18 12:55:44 +02:00
b25e30e128 Partially fix broken unit test from ellipsis changes 2016-05-13 11:31:28 +02:00
9b2d725e22 Add log warn for error log messages. Remove unused variables. 2016-05-10 16:15:34 +02:00
1d84d20a2d Ignore more clang++ warnings. Don't link with iconv64 for unit test 2016-05-10 16:15:34 +02:00
9fdfd046a1 Rename strnstr2 to strnstr 2016-05-10 16:15:32 +02:00
d77f7c69ee Add a more restrictive check for who query parameter 2016-05-10 16:15:31 +02:00
2cb2f4e877 Normalize url. Don't encode character which are not suppose to be encoded. 2016-05-10 16:15:31 +02:00
bf57e59590 Use queries map when we're not matching partial/case sensitive.
Remove more tracking params.
Combine strip session id  & tracking param into one setting. Doesn't make sense to have 2 separate setting.
2016-05-10 16:15:31 +02:00
b24efd2b98 Add more unit test & cater for more session id types 2016-05-10 16:15:31 +02:00
940b74f2e2 Split out path & path param since we can have a much more relaxed criteria for path param. 2016-05-10 16:15:31 +02:00
32120e2ef8 Separate strip tracking param unit test into smaller chunks 2016-05-10 16:15:31 +02:00
52ec5e755e Initial commit of UrlParser implementation.
Separated TitleRec version from Titledb.h
2016-05-10 16:15:31 +02:00
3e12649824 Whitespace changes in comments 2016-05-10 16:15:31 +02:00