Commit Graph

168 Commits

Author SHA1 Message Date
Ivan Skytte Jørgensen
beeddcf35d Got rid of gb-include.h 2018-07-26 17:29:51 +02:00
Ivan Skytte Jørgensen
50740e4a00 Merge branch 'master' into tokenizer 2018-03-26 12:59:24 +02:00
Ivan Skytte Jørgensen
229ff8e036 Renamed Query*::phrase* to ...bigram... 2018-03-26 12:58:02 +02:00
Ivan Skytte Jørgensen
3fd158c397 tokenizer: first shot at somethign that appears to work 2018-03-09 16:24:39 +01:00
Ai Lin Chia
9b4bb76813 Rename Query::set2 to Query::set 2018-03-02 14:09:16 +01:00
Ivan Skytte Jørgensen
bdb70c3460 Removed support for 'gbsortbyint:' and 'gbrevsortbyint:'
Thereby int32-terms 'gbspiderdate' and 'gbindexdate' are no longer needed (also removed).
2018-01-12 14:47:55 +01:00
Brian Rasmusson
93247cee3c added [nrw] keyword so you can mark query words as not required. For testing purposes. 2018-01-04 17:31:53 +01:00
Ivan Skytte Jørgensen
e33da80138 Dump more info in 'term' field in JSON output 2018-01-04 12:45:11 +01:00
Ivan Skytte Jørgensen
eae13e77de Removed unused parameter 'hasColon' from getFieldCode() 2017-12-29 12:58:00 +01:00
Ivan Skytte Jørgensen
eebe650fd1 Merge branch 'master' into sto 2017-12-11 17:07:33 +01:00
Ivan Skytte Jørgensen
aba03fe945 Removed unused QueryWord::m_level 2017-12-11 16:15:09 +01:00
Ivan Skytte Jørgensen
36b99cada4 Support [s] brackets for setting userweight on synonyms/word variations 2017-12-11 16:03:57 +01:00
Ivan Skytte Jørgensen
78425d243b Merge branch 'master' into sto 2017-12-11 15:41:57 +01:00
Ivan Skytte Jørgensen
cf9ab92e9a user-weight as floating-point
Code supported it halfway but then ignored the fractional part. Now full floating-point is supported in user wieght, eg "[3.14159265w]someword"
2017-12-11 15:40:49 +01:00
Ivan Skytte Jørgensen
6e2fac79d5 Merge branch 'master' into sto 2017-12-08 12:56:16 +01:00
Ivan Skytte Jørgensen
8878036e63 Renamed ScoringWeights to DerivedScoringWeights 2017-12-08 12:49:43 +01:00
Ivan Skytte Jørgensen
630a3c9a76 Merge branch 'master' into sto 2017-12-04 14:53:13 +01:00
Ivan Skytte Jørgensen
1c9b89a147 Changed QueryWord::opcode to a real enumeration 2017-12-04 14:53:03 +01:00
Ivan Skytte Jørgensen
011ca6ab2c Merge branch 'master' into sto 2017-12-04 13:03:12 +01:00
Ivan Skytte Jørgensen
b77ccfa338 Removed unused range-based operators (gbmin/gbmax/gbsortbyfloat etc).
The field codes were defined but never set, and tested for in several places.
2017-12-04 12:59:05 +01:00
Ivan Skytte Jørgensen
0720bafd26 Merge branch 'master' into sto 2017-12-01 18:56:59 +01:00
Ivan Skytte Jørgensen
b47884890d Removed write-only QueryTerm::m_ks 2017-12-01 18:52:46 +01:00
Ivan Skytte Jørgensen
2520a07fe5 Removed unused QueryWord::m_hardCount and QueryTerm::m_hardCount 2017-12-01 18:42:01 +01:00
Ivan Skytte Jørgensen
603c724d9e Removed QueryWord/Term::m_implicitBits/m_matchesExplicitBits/m_explicitBit and MAX_EXPLICIT_BITS
Fields were set and some calculation leading nowhere was done. ==> unused.
2017-12-01 18:35:21 +01:00
Ivan Skytte Jørgensen
3d5318c77a word variations seems to work now 2017-11-27 16:51:50 +01:00
Ivan Skytte Jørgensen
2c64859421 Made wordvarition stuff configurable. made wordvariation-da handle indefinite->definite variations 2017-11-24 16:43:19 +01:00
Ivan Skytte Jørgensen
670ff8d9e4 Merge branch 'master' into sto 2017-11-24 14:34:47 +01:00
Ivan Skytte Jørgensen
978a986b0d Changed default argument to Query::set2() to non-default 2017-11-24 14:27:02 +01:00
Ivan Skytte Jørgensen
36fd2c2615 Made lang-spec word variations configurable 2017-11-24 14:19:50 +01:00
Ivan Skytte Jørgensen
7bac917df4 Renamed Query::*queryExapnsion to wiktionaryWordVariations 2017-11-24 13:52:11 +01:00
Ivan Skytte Jørgensen
4324191bdf Added word variation support (hardcoded test implementation for now) 2017-11-23 16:03:47 +01:00
Ivan Skytte Jørgensen
050b86f9db Made QueryWord::m_word const 2017-11-23 14:48:32 +01:00
Ivan Skytte Jørgensen
74d1a4b457 Made QueryTerm::m_term const 2017-11-23 13:59:06 +01:00
Ivan Skytte Jørgensen
cad25a18e2 Dropped unused member QueryTerm::m_numAlnumWordsInBase 2017-11-23 12:41:19 +01:00
Ivan Skytte Jørgensen
128790084b Use lang_t enum more than just plain uint8_t 2017-11-21 16:18:45 +01:00
Ivan Skytte Jørgensen
d36a6a4e39 disable site clustering when doing domain-like searches (configurable) 2017-11-14 14:50:30 +01:00
Ivan Skytte Jørgensen
3ca5da2ca9 Factored out log-qterms-to-log so we can also trace them after query::modifyQuery() 2017-11-14 12:52:41 +01:00
Ai Lin Chia
84b9fe27cc Remove gbispermalink 2017-10-06 17:46:14 +02:00
Ai Lin Chia
c69273219d Check allowHighFrequencyTermCache flag before using g_htfs 2017-10-05 15:56:18 +02:00
Ivan Skytte Jørgensen
9717298c58 Changed Query::m_...Bits from members to plain local variables 2017-08-22 15:51:32 +02:00
Ivan Skytte Jørgensen
f642f81729 Removed unused Query::m_numRequired 2017-08-22 14:39:23 +02:00
Ai Lin Chia
afffa35e56 Merge branch 'master' into dev-dns 2017-07-10 23:02:56 +02:00
Ivan Skytte Jørgensen
085f861249 Added comment about why QueryWord::m_phraseLen may be less than the lengthy of the source phrase due to non-alfanum words 2017-07-07 14:43:14 +02:00
Ivan Skytte Jørgensen
22d2a83d34 Rewrite API-like queries 2017-07-07 14:19:45 +02:00
Ivan Skytte Jørgensen
b969a864e6 Made domain-like query rewrite configurable 2017-07-07 13:49:31 +02:00
Ai Lin Chia
fd8690d395 Remove some unused fields 2017-07-05 11:38:12 +02:00
Ai Lin Chia
3efde8a60d Merge branch 'master' into dev-dns 2017-07-05 11:27:56 +02:00
Ai Lin Chia
2ec678d628 Remove now unused configuration to index spider reply. Remove commented out code. Remove now unused fields. 2017-07-05 11:22:26 +02:00
Ivan Skytte Jørgensen
c3013411c5 Query rewriting: domains
querys of the form aaa.bbb.ccc or aaa.bbb are interpeted as all terms and bigrams are required and matches in URLs are gived a boost.
2017-07-04 15:14:53 +02:00
Ivan Skytte Jørgensen
7ba8c92496 Changed quer:* m_fieldCode from char to an enum 2017-06-29 15:21:23 +02:00