.. |
html
|
replaced http links with https
|
2024-07-21 18:02:58 +02:00 |
images
|
code maintenance - removed warnings and replaced deprecated functions
|
2024-11-25 12:29:11 +01:00 |
rdfa
|
code maintenance - removed warnings and replaced deprecated functions
|
2024-11-25 12:29:11 +01:00 |
xml
|
always use HTTPClient by 'try with resources' pattern to free up
|
2021-10-31 23:06:23 +01:00 |
AbstractCompressorParser.java
|
crawl profile adoption to new tag valency attribute
|
2023-01-15 01:20:12 +01:00 |
apkParser.java
|
replaced http links with https
|
2024-07-21 18:02:58 +02:00 |
audioTagParser.java
|
replaced http links with https
|
2024-07-21 18:02:58 +02:00 |
bzipParser.java
|
code maintenance - removed warnings and replaced deprecated functions
|
2024-11-25 12:29:11 +01:00 |
csvParser.java
|
replaced http links with https
|
2024-07-21 18:02:58 +02:00 |
docParser.java
|
code maintenance - removed warnings and replaced deprecated functions
|
2024-11-25 12:29:11 +01:00 |
dwgParser.java
|
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
|
2016-02-16 02:05:58 +01:00 |
genericParser.java
|
replaced http links with https
|
2024-07-21 18:02:58 +02:00 |
GenericXMLParser.java
|
code maintenance - removed warnings and replaced deprecated functions
|
2024-11-25 12:29:11 +01:00 |
gzipParser.java
|
code maintenance - removed warnings and replaced deprecated functions
|
2024-11-25 12:29:11 +01:00 |
htmlParser.java
|
replaced http links with https
|
2024-07-21 18:02:58 +02:00 |
linkScraperParser.java
|
replaced http links with https
|
2024-07-21 18:02:58 +02:00 |
mmParser.java
|
replaced http links with https
|
2024-07-21 18:02:58 +02:00 |
odtParser.java
|
code maintenance - removed warnings and replaced deprecated functions
|
2024-11-25 12:29:11 +01:00 |
ooxmlParser.java
|
Improved parsing support for OOXML spreadsheets (.xlsx)
|
2017-08-21 09:38:20 +02:00 |
pdfParser.java
|
code maintenance - removed warnings and replaced deprecated functions
|
2024-11-25 12:29:11 +01:00 |
pptParser.java
|
upgraded ppt parser by migration of org.apache,poi from 3.17 to 5.3.0
|
2024-07-21 15:28:13 +02:00 |
psParser.java
|
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
|
2016-02-16 02:05:58 +01:00 |
rdfParser.java
|
replaced http links with https
|
2024-07-21 18:02:58 +02:00 |
rssParser.java
|
replaced http links with https
|
2024-07-21 18:02:58 +02:00 |
rtfParser.java
|
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
|
2016-02-16 02:05:58 +01:00 |
sidAudioParser.java
|
replaced http links with https
|
2024-07-21 18:02:58 +02:00 |
sitemapParser.java
|
replaced http links with https
|
2024-07-21 18:02:58 +02:00 |
tarParser.java
|
code maintenance - removed warnings and replaced deprecated functions
|
2024-11-25 12:29:11 +01:00 |
torrentParser.java
|
replaced http links with https
|
2024-07-21 18:02:58 +02:00 |
vcfParser.java
|
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
|
2016-02-16 02:05:58 +01:00 |
vsdParser.java
|
removed warnings
|
2022-10-04 22:05:32 +02:00 |
xlsParser.java
|
removed warnings
|
2022-10-04 22:05:32 +02:00 |
XZParser.java
|
code maintenance - removed warnings and replaced deprecated functions
|
2024-11-25 12:29:11 +01:00 |
zipParser.java
|
replaced http links with https
|
2024-07-21 18:02:58 +02:00 |