Ivan Skytte Jørgensen
|
beeddcf35d
|
Got rid of gb-include.h
|
2018-07-26 17:29:51 +02:00 |
|
Ivan Skytte Jørgensen
|
50740e4a00
|
Merge branch 'master' into tokenizer
|
2018-03-26 12:59:24 +02:00 |
|
Ivan Skytte Jørgensen
|
229ff8e036
|
Renamed Query*::phrase* to ...bigram...
|
2018-03-26 12:58:02 +02:00 |
|
Ivan Skytte Jørgensen
|
3fd158c397
|
tokenizer: first shot at somethign that appears to work
|
2018-03-09 16:24:39 +01:00 |
|
Ai Lin Chia
|
9b4bb76813
|
Rename Query::set2 to Query::set
|
2018-03-02 14:09:16 +01:00 |
|
Ivan Skytte Jørgensen
|
bdb70c3460
|
Removed support for 'gbsortbyint:' and 'gbrevsortbyint:'
Thereby int32-terms 'gbspiderdate' and 'gbindexdate' are no longer needed (also removed).
|
2018-01-12 14:47:55 +01:00 |
|
Brian Rasmusson
|
93247cee3c
|
added [nrw] keyword so you can mark query words as not required. For testing purposes.
|
2018-01-04 17:31:53 +01:00 |
|
Ivan Skytte Jørgensen
|
e33da80138
|
Dump more info in 'term' field in JSON output
|
2018-01-04 12:45:11 +01:00 |
|
Ivan Skytte Jørgensen
|
eae13e77de
|
Removed unused parameter 'hasColon' from getFieldCode()
|
2017-12-29 12:58:00 +01:00 |
|
Ivan Skytte Jørgensen
|
eebe650fd1
|
Merge branch 'master' into sto
|
2017-12-11 17:07:33 +01:00 |
|
Ivan Skytte Jørgensen
|
aba03fe945
|
Removed unused QueryWord::m_level
|
2017-12-11 16:15:09 +01:00 |
|
Ivan Skytte Jørgensen
|
36b99cada4
|
Support [s] brackets for setting userweight on synonyms/word variations
|
2017-12-11 16:03:57 +01:00 |
|
Ivan Skytte Jørgensen
|
78425d243b
|
Merge branch 'master' into sto
|
2017-12-11 15:41:57 +01:00 |
|
Ivan Skytte Jørgensen
|
cf9ab92e9a
|
user-weight as floating-point
Code supported it halfway but then ignored the fractional part. Now full floating-point is supported in user wieght, eg "[3.14159265w]someword"
|
2017-12-11 15:40:49 +01:00 |
|
Ivan Skytte Jørgensen
|
6e2fac79d5
|
Merge branch 'master' into sto
|
2017-12-08 12:56:16 +01:00 |
|
Ivan Skytte Jørgensen
|
8878036e63
|
Renamed ScoringWeights to DerivedScoringWeights
|
2017-12-08 12:49:43 +01:00 |
|
Ivan Skytte Jørgensen
|
630a3c9a76
|
Merge branch 'master' into sto
|
2017-12-04 14:53:13 +01:00 |
|
Ivan Skytte Jørgensen
|
1c9b89a147
|
Changed QueryWord::opcode to a real enumeration
|
2017-12-04 14:53:03 +01:00 |
|
Ivan Skytte Jørgensen
|
011ca6ab2c
|
Merge branch 'master' into sto
|
2017-12-04 13:03:12 +01:00 |
|
Ivan Skytte Jørgensen
|
b77ccfa338
|
Removed unused range-based operators (gbmin/gbmax/gbsortbyfloat etc).
The field codes were defined but never set, and tested for in several places.
|
2017-12-04 12:59:05 +01:00 |
|
Ivan Skytte Jørgensen
|
0720bafd26
|
Merge branch 'master' into sto
|
2017-12-01 18:56:59 +01:00 |
|
Ivan Skytte Jørgensen
|
b47884890d
|
Removed write-only QueryTerm::m_ks
|
2017-12-01 18:52:46 +01:00 |
|
Ivan Skytte Jørgensen
|
2520a07fe5
|
Removed unused QueryWord::m_hardCount and QueryTerm::m_hardCount
|
2017-12-01 18:42:01 +01:00 |
|
Ivan Skytte Jørgensen
|
603c724d9e
|
Removed QueryWord/Term::m_implicitBits/m_matchesExplicitBits/m_explicitBit and MAX_EXPLICIT_BITS
Fields were set and some calculation leading nowhere was done. ==> unused.
|
2017-12-01 18:35:21 +01:00 |
|
Ivan Skytte Jørgensen
|
3d5318c77a
|
word variations seems to work now
|
2017-11-27 16:51:50 +01:00 |
|
Ivan Skytte Jørgensen
|
2c64859421
|
Made wordvarition stuff configurable. made wordvariation-da handle indefinite->definite variations
|
2017-11-24 16:43:19 +01:00 |
|
Ivan Skytte Jørgensen
|
670ff8d9e4
|
Merge branch 'master' into sto
|
2017-11-24 14:34:47 +01:00 |
|
Ivan Skytte Jørgensen
|
978a986b0d
|
Changed default argument to Query::set2() to non-default
|
2017-11-24 14:27:02 +01:00 |
|
Ivan Skytte Jørgensen
|
36fd2c2615
|
Made lang-spec word variations configurable
|
2017-11-24 14:19:50 +01:00 |
|
Ivan Skytte Jørgensen
|
7bac917df4
|
Renamed Query::*queryExapnsion to wiktionaryWordVariations
|
2017-11-24 13:52:11 +01:00 |
|
Ivan Skytte Jørgensen
|
4324191bdf
|
Added word variation support (hardcoded test implementation for now)
|
2017-11-23 16:03:47 +01:00 |
|
Ivan Skytte Jørgensen
|
050b86f9db
|
Made QueryWord::m_word const
|
2017-11-23 14:48:32 +01:00 |
|
Ivan Skytte Jørgensen
|
74d1a4b457
|
Made QueryTerm::m_term const
|
2017-11-23 13:59:06 +01:00 |
|
Ivan Skytte Jørgensen
|
cad25a18e2
|
Dropped unused member QueryTerm::m_numAlnumWordsInBase
|
2017-11-23 12:41:19 +01:00 |
|
Ivan Skytte Jørgensen
|
128790084b
|
Use lang_t enum more than just plain uint8_t
|
2017-11-21 16:18:45 +01:00 |
|
Ivan Skytte Jørgensen
|
d36a6a4e39
|
disable site clustering when doing domain-like searches (configurable)
|
2017-11-14 14:50:30 +01:00 |
|
Ivan Skytte Jørgensen
|
3ca5da2ca9
|
Factored out log-qterms-to-log so we can also trace them after query::modifyQuery()
|
2017-11-14 12:52:41 +01:00 |
|
Ai Lin Chia
|
84b9fe27cc
|
Remove gbispermalink
|
2017-10-06 17:46:14 +02:00 |
|
Ai Lin Chia
|
c69273219d
|
Check allowHighFrequencyTermCache flag before using g_htfs
|
2017-10-05 15:56:18 +02:00 |
|
Ivan Skytte Jørgensen
|
9717298c58
|
Changed Query::m_...Bits from members to plain local variables
|
2017-08-22 15:51:32 +02:00 |
|
Ivan Skytte Jørgensen
|
f642f81729
|
Removed unused Query::m_numRequired
|
2017-08-22 14:39:23 +02:00 |
|
Ai Lin Chia
|
afffa35e56
|
Merge branch 'master' into dev-dns
|
2017-07-10 23:02:56 +02:00 |
|
Ivan Skytte Jørgensen
|
085f861249
|
Added comment about why QueryWord::m_phraseLen may be less than the lengthy of the source phrase due to non-alfanum words
|
2017-07-07 14:43:14 +02:00 |
|
Ivan Skytte Jørgensen
|
22d2a83d34
|
Rewrite API-like queries
|
2017-07-07 14:19:45 +02:00 |
|
Ivan Skytte Jørgensen
|
b969a864e6
|
Made domain-like query rewrite configurable
|
2017-07-07 13:49:31 +02:00 |
|
Ai Lin Chia
|
fd8690d395
|
Remove some unused fields
|
2017-07-05 11:38:12 +02:00 |
|
Ai Lin Chia
|
3efde8a60d
|
Merge branch 'master' into dev-dns
|
2017-07-05 11:27:56 +02:00 |
|
Ai Lin Chia
|
2ec678d628
|
Remove now unused configuration to index spider reply. Remove commented out code. Remove now unused fields.
|
2017-07-05 11:22:26 +02:00 |
|
Ivan Skytte Jørgensen
|
c3013411c5
|
Query rewriting: domains
querys of the form aaa.bbb.ccc or aaa.bbb are interpeted as all terms and bigrams are required and matches in URLs are gived a boost.
|
2017-07-04 15:14:53 +02:00 |
|
Ivan Skytte Jørgensen
|
7ba8c92496
|
Changed quer:* m_fieldCode from char to an enum
|
2017-06-29 15:21:23 +02:00 |
|