<86>Apr 5 07:56:29 userdel[110672]: delete user 'rooter' <86>Apr 5 07:56:29 groupadd[110683]: group added to /etc/group: name=rooter, GID=645 <86>Apr 5 07:56:29 groupadd[110683]: group added to /etc/gshadow: name=rooter <86>Apr 5 07:56:29 groupadd[110683]: new group: name=rooter, GID=645 <86>Apr 5 07:56:29 useradd[110696]: new user: name=rooter, UID=645, GID=645, home=/root, shell=/bin/bash <86>Apr 5 07:56:29 userdel[110721]: delete user 'builder' <86>Apr 5 07:56:29 userdel[110721]: removed group 'builder' owned by 'builder' <86>Apr 5 07:56:29 userdel[110721]: removed shadow group 'builder' owned by 'builder' <86>Apr 5 07:56:29 groupadd[110734]: group added to /etc/group: name=builder, GID=646 <86>Apr 5 07:56:29 groupadd[110734]: group added to /etc/gshadow: name=builder <86>Apr 5 07:56:29 groupadd[110734]: new group: name=builder, GID=646 <86>Apr 5 07:56:29 useradd[110744]: new user: name=builder, UID=646, GID=646, home=/usr/src, shell=/bin/bash warning: user igor does not exist - using root warning: group igor does not exist - using root warning: user igor does not exist - using root warning: group igor does not exist - using root warning: user igor does not exist - using root warning: group igor does not exist - using root warning: user igor does not exist - using root warning: group igor does not exist - using root warning: user igor does not exist - using root warning: group igor does not exist - using root warning: user igor does not exist - using root warning: group igor does not exist - using root <13>Apr 5 07:56:31 rpmi: rpm-macros-java-1:5.0.0-alt1_12jpp8 1525973129 installed <13>Apr 5 07:56:37 rpmi: javapackages-tools-1:5.0.0-alt1_12jpp8 1525973129 installed <13>Apr 5 07:56:37 rpmi: libjpeg-2:1.5.1-alt1 1498218318 installed <13>Apr 5 07:56:37 rpmi: libexpat-2.2.4-alt1 1503305345 installed <13>Apr 5 07:56:37 rpmi: libpng16-1.6.36-alt1 sisyphus+219478.100.1.1 1547633314 installed <13>Apr 5 07:56:38 rpmi: xorg-proto-devel-2018.4-alt3 1527685079 installed <13>Apr 5 07:56:38 rpmi: beust-jcommander-1.71-alt1_3jpp8 1523858260 installed <13>Apr 5 07:56:38 rpmi: xmvn-api-3.0.0-alt1_18jpp8 1527991448 installed <13>Apr 5 07:56:38 rpmi: xmvn-core-3.0.0-alt1_18jpp8 1527991448 installed <13>Apr 5 07:56:38 rpmi: xml-commons-apis-1.4.01-alt3_26jpp8 sisyphus+220521.100.1.1 1549294732 installed <13>Apr 5 07:56:38 rpmi: libICE-1.0.9-alt1 1409902721 installed <13>Apr 5 07:56:38 rpmi: libogg-1.3.3-alt1 sisyphus+221902.4000.4.1 1550598661 installed <13>Apr 5 07:56:38 rpmi: liblksctp-1.0.17-alt2 1523113261 installed <13>Apr 5 07:56:38 rpmi: libSM-1.2.3-alt1 sisyphus.215747.100 1540812795 installed <13>Apr 5 07:56:38 rpmi: java-common-1.5.0-alt1 1329330500 installed <13>Apr 5 07:56:38 rpmi: xml-utils-1:2.9.4.0.12.e905-alt1.1 1525115767 installed <13>Apr 5 07:56:38 rpmi: libgif-4.1.6-alt3 1299634261 installed <13>Apr 5 07:56:38 rpmi: libglvnd-7:1.1.1-alt1 sisyphus+224993.100.4.1 1552634542 installed <13>Apr 5 07:56:38 rpmi: libwayland-server-1.17.0-alt1 sisyphus+225894.100.1.1 1553872805 installed <13>Apr 5 07:56:38 rpmi: libalsa-1:1.1.8-alt2 sisyphus+221894.200.4.1 1550583286 installed <13>Apr 5 07:56:38 rpmi: javazi-2018i-alt1 sisyphus+221902.5300.4.1 1550600298 installed <13>Apr 5 07:56:38 rpmi: lksctp-tools-1.0.17-alt2 1523113261 installed <13>Apr 5 07:56:38 rpmi: libflac8-1.3.2-alt2 sisyphus+220898.4400.11.1 1551973321 installed <13>Apr 5 07:56:38 rpmi: libvorbis-1.3.6-alt2 sisyphus+220072.200.2.2 1548744475 installed <13>Apr 5 07:56:38 rpmi: libICE-devel-1.0.9-alt1 1409902721 installed <13>Apr 5 07:56:38 rpmi: libSM-devel-1.2.3-alt1 sisyphus.215747.100 1540812795 installed <13>Apr 5 07:56:38 rpmi: libjasper-2.0.14-alt1 1530105217 installed <13>Apr 5 07:56:38 rpmi: libtiff5-4.0.3-alt1 1348347501 installed <13>Apr 5 07:56:38 rpmi: ant-lib-0:1.10.3-alt1_2jpp8 1528243545 installed <13>Apr 5 07:56:38 rpmi: objenesis-0:2.6-alt1_1jpp8 1511395274 installed <13>Apr 5 07:56:38 rpmi: apache-commons-compress-0:1.16.1-alt1_1jpp8 1526491832 installed <13>Apr 5 07:56:38 rpmi: bcel-1:6.2-alt1_2jpp8 1525817590 installed <13>Apr 5 07:56:38 rpmi: slf4j-0:1.7.25-alt1_4jpp8 1525924634 installed <13>Apr 5 07:56:38 rpmi: zip-30000000:3.0-alt1 1332241772 installed <13>Apr 5 07:56:38 rpmi: sgml-common-0.6.3-alt15 1423664786 installed <13>Apr 5 07:56:38 rpmi: docbook-dtds-4.5-alt1 1223476557 installed <13>Apr 5 07:56:38 rpmi: docbook-style-xsl-1.79.1-alt2 sisyphus.213665.100 1537949315 installed <13>Apr 5 07:56:38 rpmi: libnatspec-0.3.1-alt2 1445691580 installed <13>Apr 5 07:56:38 rpmi: unzip-6.0-alt2.qa1 1366155324 installed <13>Apr 5 07:56:38 rpmi: libgdbm-1.8.3-alt10 1454943334 installed <13>Apr 5 07:56:38 rpmi: libgsm-1.0.17-alt1 1523356165 installed <13>Apr 5 07:56:38 rpmi: libsndfile-1.0.28-alt2 sisyphus.212728.100 1536333068 installed <13>Apr 5 07:56:38 rpmi: libasyncns-0.8-alt2.qa1 1365949820 installed <13>Apr 5 07:56:39 rpmi: libgtk+2-locales-2.24.32-alt2 1518699309 installed <13>Apr 5 07:56:39 rpmi: libdatrie-0.2.9-alt1_6 1511686676 installed <13>Apr 5 07:56:39 rpmi: libthai-0.1.28-alt1_1 sisyphus+226107.100.1.1 1554123079 installed <13>Apr 5 07:56:39 rpmi: libfribidi-1.0.5-alt1 1532424345 installed <13>Apr 5 07:56:39 rpmi: libpixman-3:0.38.0-alt1 sisyphus+221327.100.1.1 1549959657 installed <13>Apr 5 07:56:39 rpmi: libxshmfence-1.3-alt1 sisyphus+223149.1000.2.1 1551268571 installed <13>Apr 5 07:56:39 rpmi: libwayland-client-1.17.0-alt1 sisyphus+225894.100.1.1 1553872805 installed <13>Apr 5 07:56:39 rpmi: libpciaccess-1:0.14-alt1 1528969252 installed <13>Apr 5 07:56:39 rpmi: libdrm-1:2.4.97-alt1 sisyphus+220483.100.1.1 1549270242 installed <13>Apr 5 07:56:39 rpmi: libgbm-4:19.0.1-alt1 sisyphus+225820.100.1.1 1553767902 installed <13>Apr 5 07:56:39 rpmi: libatk-locales-2.32.0-alt1 sisyphus+225059.600.3.2 1552845198 installed <13>Apr 5 07:56:39 rpmi: libatk-2.32.0-alt1 sisyphus+225059.600.3.2 1552845362 installed <13>Apr 5 07:56:39 rpmi: libX11-locales-3:1.6.7-alt1 sisyphus.214413.200 1539171080 installed <13>Apr 5 07:56:39 rpmi: libXdmcp-1.1.3-alt1 sisyphus+225206.600.1.2 1552949353 installed <13>Apr 5 07:56:39 rpmi: libXau-1.0.9-alt1 sisyphus+223149.200.2.1 1551268152 installed <13>Apr 5 07:56:39 rpmi: libxcb-1.13.1-alt1 sisyphus.214413.100 1539170896 installed <13>Apr 5 07:56:39 rpmi: libX11-3:1.6.7-alt1 sisyphus.214413.200 1539171143 installed <13>Apr 5 07:56:39 rpmi: libXext-1.3.4-alt1 sisyphus+225206.700.1.2 1552949429 installed <13>Apr 5 07:56:39 rpmi: libXrender-0.9.8-alt1 1371312112 installed <13>Apr 5 07:56:39 rpmi: libXi-1.7.9-alt2 sisyphus+226377.100.1.2 1554260260 installed <13>Apr 5 07:56:39 rpmi: libXcomposite-0.4.5-alt1 sisyphus+225206.300.1.2 1552949137 installed <13>Apr 5 07:56:39 rpmi: libXfixes-5.0.3-alt1 sisyphus.216396.300 1542022162 installed <13>Apr 5 07:56:39 rpmi: libXtst-1.2.2-alt1 1369984893 installed <13>Apr 5 07:56:39 rpmi: libXdamage-1.1.5-alt1 sisyphus+225206.500.1.2 1552949286 installed <13>Apr 5 07:56:39 rpmi: libXcursor-1.2.0-alt1 sisyphus+225206.400.1.2 1552949218 installed <13>Apr 5 07:56:39 rpmi: libXrandr-1.5.2-alt1 sisyphus+225206.1300.1.2 1552949710 installed <13>Apr 5 07:56:39 rpmi: libXinerama-1.1.4-alt1 sisyphus+223149.300.2.1 1551268216 installed <13>Apr 5 07:56:39 rpmi: libXxf86vm-1.1.4-alt2 1527672187 installed <13>Apr 5 07:56:39 rpmi: libGLX-mesa-4:19.0.1-alt1 sisyphus+225820.100.1.1 1553767902 installed <13>Apr 5 07:56:39 rpmi: libEGL-mesa-4:19.0.1-alt1 sisyphus+225820.100.1.1 1553767902 installed <13>Apr 5 07:56:39 rpmi: libEGL-7:1.1.1-alt1 sisyphus+224993.100.4.1 1552634542 installed <13>Apr 5 07:56:39 rpmi: libGLX-7:1.1.1-alt1 sisyphus+224993.100.4.1 1552634542 installed <13>Apr 5 07:56:39 rpmi: libGL-7:1.1.1-alt1 sisyphus+224993.100.4.1 1552634542 installed <13>Apr 5 07:56:39 rpmi: libXt-1.1.4-alt1 1369984722 installed <13>Apr 5 07:56:40 rpmi: libxcb-devel-1.13.1-alt1 sisyphus.214413.100 1539170896 installed <13>Apr 5 07:56:40 rpmi: libX11-devel-3:1.6.7-alt1 sisyphus.214413.200 1539171143 installed <13>Apr 5 07:56:40 rpmi: libXt-devel-1.1.4-alt1 1369984722 installed <13>Apr 5 07:56:40 rpmi: libpcsclite-1.8.23-alt1 1513827863 installed <13>Apr 5 07:56:40 rpmi: libverto-0.3.0-alt1_7 sisyphus+225932.100.1.1 1553994919 installed <13>Apr 5 07:56:40 rpmi: libkeyutils-1.6-alt2 sisyphus.217337.100 1544003161 installed <13>Apr 5 07:56:40 rpmi: libcom_err-1.44.6-alt1 sisyphus+224154.100.1.1 1552091678 installed <13>Apr 5 07:56:40 rpmi: liblz4-1:1.8.3-alt2 sisyphus+221902.4200.4.1 1550599659 installed <13>Apr 5 07:56:40 rpmi: libgpg-error-1.36-alt1 sisyphus+225621.300.1.1 1553521082 installed <13>Apr 5 07:56:41 rpmi: libgcrypt20-1.8.4-alt1 sisyphus+225621.500.1.1 1553521735 installed <13>Apr 5 07:56:41 rpmi: libsystemd-1:241-alt4 sisyphus+226361.200.3.1 1554170427 installed <13>Apr 5 07:56:41 rpmi: libdbus-1.12.12-alt2 sisyphus+221234.100.1.2 1549918047 installed <13>Apr 5 07:56:41 rpmi: libavahi-0.6.32-alt1 1500485702 installed <13>Apr 5 07:56:41 rpmi: libcups-2.2.11-alt1 sisyphus+225793.100.1.1 1553701176 installed <13>Apr 5 07:56:41 rpmi: libpulseaudio-12.2-alt1 1535623585 installed <13>Apr 5 07:56:41 rpmi: libxslt-1.1.32-alt2 1517429984 installed <13>Apr 5 07:56:41 rpmi: libsqlite3-3.27.2-alt1 sisyphus+225506.100.1.1 1553253705 installed <13>Apr 5 07:56:41 rpmi: libnspr-1:4.21-alt1 sisyphus+226302.200.1.1 1554055346 installed <13>Apr 5 07:56:41 rpmi: libgraphite2-1.3.13-alt1 sisyphus.218545.100 1545686511 installed <13>Apr 5 07:56:41 rpmi: libharfbuzz-2.2.0-alt1 sisyphus.218134.500 1545261518 installed <13>Apr 5 07:56:41 rpmi: libfreetype-2.10.0-alt1 sisyphus+225205.100.1.2 1552930259 installed <13>Apr 5 07:56:41 rpmi: fontconfig-2.13.1-alt1 sisyphus.215917.100 1540973886 installed Updating fonts cache: <29>Apr 5 07:56:42 fontconfig: Updating fonts cache: succeeded [ DONE ] <13>Apr 5 07:56:42 rpmi: fonts-type1-xorg-7.0.0-alt4 1188553211 installed <13>Apr 5 07:56:42 rpmi: libcairo-1:1.16.0-alt1 sisyphus.215566.100 1540457683 installed <13>Apr 5 07:56:42 rpmi: libXft-2.3.3-alt1 sisyphus+225206.1000.3.2 1552987708 installed <13>Apr 5 07:56:42 rpmi: libpango-1.42.4-alt1 1534787259 installed <13>Apr 5 07:56:42 rpmi: icon-theme-hicolor-0.17-alt1 1505715846 installed <13>Apr 5 07:56:42 rpmi: shared-mime-info-1.12-alt1 sisyphus+219597.100.2.2 1548057005 installed <13>Apr 5 07:56:42 rpmi: libgdk-pixbuf-locales-2.38.1-alt1 sisyphus+223283.100.1.1 1551374215 installed <13>Apr 5 07:56:42 rpmi: gsettings-desktop-schemas-data-3.32.0-alt1 sisyphus+225059.300.3.2 1552843929 installed <13>Apr 5 07:56:42 rpmi: libgio-2.60.0-alt1 sisyphus+225059.100.3.2 1552843618 installed <13>Apr 5 07:56:42 rpmi: gsettings-desktop-schemas-3.32.0-alt1 sisyphus+225059.300.3.2 1552843980 installed <13>Apr 5 07:56:42 rpmi: libgdk-pixbuf-2.38.1-alt1 sisyphus+223283.100.1.1 1551374252 installed <13>Apr 5 07:56:42 rpmi: gtk-update-icon-cache-3.24.7-alt1 sisyphus+225059.1000.3.2 1552845896 installed <13>Apr 5 07:56:43 rpmi: libgtk+2-2.24.32-alt2 1518699309 installed <13>Apr 5 07:56:43 rpmi: libdbus-glib-1:0.106-alt1 1454672854 installed <13>Apr 5 07:56:43 rpmi: libp11-kit-0.23.15-alt1 sisyphus+226408.100.2.1 1554288204 installed <13>Apr 5 07:56:43 rpmi: libtasn1-4.13-alt2 1521133850 installed <13>Apr 5 07:56:43 rpmi: rpm-macros-alternatives-0.5.0-alt1 sisyphus+221902.300.4.1 1550587121 installed <13>Apr 5 07:56:43 rpmi: alternatives-0.5.0-alt1 sisyphus+221902.300.4.1 1550587121 installed <13>Apr 5 07:56:43 rpmi: libnss-3.43.0-alt1 sisyphus+226302.300.1.1 1554055693 installed <13>Apr 5 07:56:43 rpmi: ca-certificates-2019.03.31-alt1 sisyphus+226302.100.1.1 1554055265 installed <13>Apr 5 07:56:43 rpmi: ca-trust-0.1.1-alt2 1515595785 installed <13>Apr 5 07:56:43 rpmi: p11-kit-trust-0.23.15-alt1 sisyphus+226408.100.2.1 1554288204 installed <13>Apr 5 07:56:43 rpmi: libcrypto1.1-1.1.1b-alt1 sisyphus+225327.200.2.1 1553099317 installed <13>Apr 5 07:56:43 rpmi: libssl1.1-1.1.1b-alt1 sisyphus+225327.200.2.1 1553099317 installed <13>Apr 5 07:56:43 rpmi: libpython3-3.6.8-alt1 sisyphus+220164.200.3.1 1548842636 installed <13>Apr 5 07:56:43 rpmi: python3-3.6.8-alt1 sisyphus+220164.200.3.1 1548842636 installed <13>Apr 5 07:56:44 rpmi: python3-base-3.6.8-alt1 sisyphus+220164.200.3.1 1548842636 installed <13>Apr 5 07:56:44 rpmi: python3-module-sugarbowl-0.52.1-alt1.git20141130.1.1 1517983623 installed <13>Apr 5 07:56:44 rpmi: python3-module-six-1.12.0-alt1 sisyphus+219665.100.2.1 1548148570 installed <86>Apr 5 07:56:44 groupadd[14593]: group added to /etc/group: name=_keytab, GID=499 <86>Apr 5 07:56:44 groupadd[14593]: group added to /etc/gshadow: name=_keytab <86>Apr 5 07:56:44 groupadd[14593]: new group: name=_keytab, GID=499 <13>Apr 5 07:56:44 rpmi: libkrb5-1.16.3-alt1 sisyphus+223678.100.1.1 1551746516 installed <13>Apr 5 07:56:44 rpmi: ca-trust-java-0.1.1-alt2 1515595785 installed <13>Apr 5 07:56:46 rpmi: java-1.8.0-openjdk-headless-0:1.8.0.151-alt2_5.b12jpp8 sisyphus+221510.200.2.1 1550224825 installed <13>Apr 5 07:56:47 rpmi: java-1.8.0-openjdk-0:1.8.0.151-alt2_5.b12jpp8 sisyphus+221510.200.2.1 1550224825 installed <86>Apr 5 07:56:47 groupadd[25035]: group added to /etc/group: name=sasl, GID=498 <86>Apr 5 07:56:47 groupadd[25035]: group added to /etc/gshadow: name=sasl <86>Apr 5 07:56:47 groupadd[25035]: new group: name=sasl, GID=498 <13>Apr 5 07:56:47 rpmi: libsasl2-3-2.1.27-alt1 sisyphus+223971.100.1.1 1551928460 installed <13>Apr 5 07:56:47 rpmi: libldap-2.4.46-alt1.1 sisyphus+219907.4400.1.1 1548349979 installed <13>Apr 5 07:56:47 rpmi: libGConf-3.2.6-alt3 1455932638 installed <13>Apr 5 07:56:50 rpmi: java-1.7.0-openjdk-headless-0:1.7.0.181-alt1_2.6.14.8jpp8 1528046800 installed <13>Apr 5 07:56:50 rpmi: java-1.7.0-openjdk-0:1.7.0.181-alt1_2.6.14.8jpp8 1528046800 installed <13>Apr 5 07:56:51 rpmi: java-1.7.0-openjdk-devel-0:1.7.0.181-alt1_2.6.14.8jpp8 1528046800 installed <13>Apr 5 07:56:51 rpmi: python3-module-markupsafe-0.23-alt1.2.1.1 1525118834 installed <13>Apr 5 07:56:51 rpmi: python3-module-jinja2-2.10-alt1 1521724576 installed <13>Apr 5 07:56:51 rpmi: python3-module-clyde-0.8.0-alt1.git20141130.2.1 1517980014 installed <13>Apr 5 07:56:51 rpmi: python3-module-pkg_resources-1:40.8.0-alt1 sisyphus+221229.100.2.1 1550559950 installed <13>Apr 5 07:56:51 rpmi: python3-module-runfile-0.46.1-alt1.git20141130.2.1 1517983182 installed <13>Apr 5 07:56:51 rpmi: objectweb-asm-0:6.1.1-alt1_1jpp8 1528136365 installed <13>Apr 5 07:56:51 rpmi: xmvn-install-3.0.0-alt1_18jpp8 1527991448 installed <13>Apr 5 07:56:51 rpmi: xmvn-subst-3.0.0-alt1_18jpp8 1527991448 installed <13>Apr 5 07:56:51 rpmi: xmvn-resolve-3.0.0-alt1_18jpp8 1527991448 installed <13>Apr 5 07:56:51 rpmi: xml-commons-resolver-0:1.2-alt1_24jpp8 1525932051 installed <13>Apr 5 07:56:51 rpmi: xalan-j2-0:2.7.1-alt4_34jpp8 1525931290 installed <13>Apr 5 07:56:52 rpmi: xerces-j2-0:2.11.0-alt3_31jpp8 1524211519 installed <13>Apr 5 07:56:52 rpmi: python3-module-genshi-0.7-alt1.1.1.1 1460400448 installed <13>Apr 5 07:56:52 rpmi: python3-module-webencodings-0.5.1-alt1.1 1517943573 installed <13>Apr 5 07:56:52 rpmi: python3-module-cssselect-0.9.1-alt1.2 1526980827 installed <13>Apr 5 07:56:52 rpmi: python3-module-html5lib-1:0.999999999-alt4.qa1 sisyphus.214868.100 1539741045 installed <13>Apr 5 07:56:52 rpmi: python3-module-lxml-4.3.3-alt1 sisyphus+225790.100.1.1 1553699239 installed <13>Apr 5 07:56:52 rpmi: python3-module-javapackages-1:5.0.0-alt1_12jpp8 1525973129 installed <13>Apr 5 07:56:52 rpmi: rpm-build-java-1:5.0.0-alt1_12jpp8 1525973129 installed <13>Apr 5 07:56:52 rpmi: java-stub-javadoc-0.1-alt1 1229813340 installed <13>Apr 5 07:56:52 rpmi: jpackage-generic-compat-0.29-alt1 1523537205 installed <13>Apr 5 07:56:52 rpmi: javapackages-local-1:5.0.0-alt1_12jpp8 1525973129 installed <13>Apr 5 07:56:52 rpmi: nekohtml-0:1.9.22-alt1_6jpp8 1527988559 installed <13>Apr 5 07:56:53 rpmi: java-1.8.0-openjdk-devel-0:1.8.0.151-alt2_5.b12jpp8 sisyphus+221510.200.2.1 1550224825 installed <13>Apr 5 07:56:53 rpmi: ant-0:1.10.3-alt1_2jpp8 1528243545 installed Building target platforms: i586 Building for target i586 Wrote: /usr/src/in/nosrpm/boilerpipe-1.2.0-alt1_12jpp8.nosrc.rpm Installing boilerpipe-1.2.0-alt1_12jpp8.src.rpm Building target platforms: i586 Building for target i586 Executing(%prep): /bin/sh -e /usr/src/tmp/rpm-tmp.86415 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + rm -rf boilerpipe-1.2.0 + echo 'Source #0 (boilerpipe-1.2.0-src.tar.gz):' Source #0 (boilerpipe-1.2.0-src.tar.gz): + /bin/gzip -dc /usr/src/RPM/SOURCES/boilerpipe-1.2.0-src.tar.gz + /bin/tar -xf - + cd boilerpipe-1.2.0 + /bin/chmod -c -Rf u+rwX,go-w . + find . -iname '*.jar' -delete + find . -iname '*.class' -delete + echo 'Patch #0 (boilerpipe-1.2.0-libdir-patch):' Patch #0 (boilerpipe-1.2.0-libdir-patch): + /usr/bin/patch -p0 patching file build.xml + cp /usr/src/RPM/SOURCES/boilerpipe-1.2.0.pom pom.xml + echo 'Patch #1 (boilerpipe-1.2.0-nekohtml-patch):' Patch #1 (boilerpipe-1.2.0-nekohtml-patch): + /usr/bin/patch -p1 patching file pom.xml patching file src/main/org/cyberneko/html/HTMLElements.java patching file src/main/org/cyberneko/html/HTMLTagBalancer.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextBlock.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/document/TextDocument.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/TagAction.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + exit 0 Executing(%build): /bin/sh -e /usr/src/tmp/rpm-tmp.34048 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + cd boilerpipe-1.2.0 + ant -Dapp.javaversion=1.6 Buildfile: /usr/src/RPM/BUILD/boilerpipe-1.2.0/build.xml clean: [mkdir] Created dir: /usr/src/RPM/BUILD/boilerpipe-1.2.0/javadoc/1.2 init: [mkdir] Created dir: /usr/src/RPM/BUILD/boilerpipe-1.2.0/build [mkdir] Created dir: /usr/src/RPM/BUILD/boilerpipe-1.2.0/build/main [mkdir] Created dir: /usr/src/RPM/BUILD/boilerpipe-1.2.0/build/demo [mkdir] Created dir: /usr/src/RPM/BUILD/boilerpipe-1.2.0/dist javadoc: [javadoc] Generating Javadoc [javadoc] Javadoc execution [javadoc] Loading source files for package de.l3s.boilerpipe... [javadoc] Loading source files for package de.l3s.boilerpipe.conditions... [javadoc] Loading source files for package de.l3s.boilerpipe.document... [javadoc] Loading source files for package de.l3s.boilerpipe.estimators... [javadoc] Loading source files for package de.l3s.boilerpipe.extractors... [javadoc] Loading source files for package de.l3s.boilerpipe.filters.english... [javadoc] Loading source files for package de.l3s.boilerpipe.filters.heuristics... [javadoc] Loading source files for package de.l3s.boilerpipe.filters.simple... [javadoc] Loading source files for package de.l3s.boilerpipe.labels... [javadoc] Loading source files for package de.l3s.boilerpipe.sax... [javadoc] Loading source files for package de.l3s.boilerpipe.util... [javadoc] Constructing Javadoc information... [javadoc] Standard Doclet version 1.8.0_151 [javadoc] Building tree for all the packages and classes... [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java:21: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java:33: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java:44: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java:54: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/BoilerpipeFilter.java:36: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/BoilerpipeInput.java:32: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java:33: warning: no description for @param [javadoc] * @param tb [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java:34: error: malformed HTML [javadoc] * @return iff the condition is met. [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/document/TextBlock.java:252: warning: no description for @return [javadoc] * @return [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/document/TextDocument.java:78: warning: no description for @param [javadoc] * @param title [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java:46: warning: no description for @param [javadoc] * @param dsBefore [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java:47: warning: no description for @param [javadoc] * @param dsAfter [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java:43: warning: no @return [javadoc] public static ArticleExtractor getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java:47: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java:64: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java:83: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java:98: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java:109: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java:36: warning: no @return [javadoc] public static ArticleSentencesExtractor getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java:43: warning: no @return [javadoc] public static CanolaExtractor getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java:37: warning: no @return [javadoc] public static DefaultExtractor getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java:42: warning: no @return [javadoc] public static LargestContentExtractor getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java:36: warning: no @return [javadoc] public static NumWordsRulesExtractor getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java:43: warning: no @return [javadoc] public static DensityRulesClassifier getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java:47: warning: no @return [javadoc] public static IgnoreBlocksAfterContentFilter getDefaultInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java:42: warning: no @return [javadoc] public static NumWordsRulesClassifier getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java:40: warning: no @return [javadoc] public static TerminatingBlocksFinder getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java:44: error: @param name not found [javadoc] * @param maxBlocksDistance The maximum distance in blocks. [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java:45: error: @param name not found [javadoc] * @param contentOnly [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java:45: warning: no description for @param [javadoc] * @param contentOnly [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java:47: warning: no @param for labelPrefix [javadoc] public AddPrecedingLabelsFilter(final String labelPrefix) { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java:55: warning: no description for @param [javadoc] * @param contentOnly [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java:57: warning: no @param for sameTagLevelOnly [javadoc] public BlockProximityFusion(final int maxBlocksDistance, [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java:40: warning: no @return [javadoc] public static ExpandTitleToContentFilter getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java:45: error: @param name not found [javadoc] * @param maxBlocksDistance The maximum distance in blocks. [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java:46: error: @param name not found [javadoc] * @param contentOnly [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java:46: warning: no description for @param [javadoc] * @param contentOnly [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java:48: warning: no @param for labelPrefix [javadoc] public LabelFusion(final String labelPrefix) { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java:39: warning: no @return [javadoc] public static SimpleBlockFusionProcessor getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java:39: warning: no @return [javadoc] public static BoilerplateBlockFilter getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java:45: warning: no @return [javadoc] public static SplitParagraphBlocksFilter getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java:47: warning: no description for @param [javadoc] * @param contentHandler [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java:59: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java:40: warning: no description for @param [javadoc] * @param is [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java:41: warning: no description for @throws [javadoc] * @throws SAXException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java:27: warning: no description for @param [javadoc] * @param url [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java:28: warning: no description for @return [javadoc] * @return [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java:29: warning: no description for @throws [javadoc] * @throws IOException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:54: warning: no @return [javadoc] public static HTMLHighlighter newHighlightingInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:62: warning: no @return [javadoc] public static HTMLHighlighter newExtractingInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:88: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:90: warning: no @return [javadoc] public String process(final TextDocument doc, final String origHTML) [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:103: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:105: warning: no @return [javadoc] public String process(final TextDocument doc, final InputSource is) [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:162: warning: no @return [javadoc] public boolean isOutputHighlightOnly() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:170: warning: no @param for outputHighlightOnly [javadoc] public void setOutputHighlightOnly(boolean outputHighlightOnly) { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:181: warning: no @return [javadoc] public String getExtraStyleSheet() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:203: error: invalid entity &qupt; [javadoc] * <span class=&qupt;x-boilerpipe-mark1"> [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:205: warning: no @return [javadoc] public String getPreHighlight() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:215: warning: no @param for preHighlight [javadoc] public void setPreHighlight(String preHighlight) { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:225: warning: no @return [javadoc] public String getPostHighlight() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:234: warning: no @param for postHighlight [javadoc] public void setPostHighlight(String postHighlight) { [javadoc] ^ [javadoc] Building index for all the packages and classes... [javadoc] Building index for all classes... [javadoc] Generating /usr/src/RPM/BUILD/boilerpipe-1.2.0/javadoc/1.2/help-doc.html... [javadoc] 6 errors [javadoc] 56 warnings compile: [javac] /usr/src/RPM/BUILD/boilerpipe-1.2.0/build.xml:93: warning: 'includeantruntime' was not set, defaulting to build.sysclasspath=last; set to false for repeatable builds [javac] Compiling 62 source files to /usr/src/RPM/BUILD/boilerpipe-1.2.0/build/main [javac] warning: [options] bootstrap class path not set in conjunction with -source 1.6 [javac] 1 warning [javac] /usr/src/RPM/BUILD/boilerpipe-1.2.0/build.xml:94: warning: 'includeantruntime' was not set, defaulting to build.sysclasspath=last; set to false for repeatable builds [javac] Compiling 3 source files to /usr/src/RPM/BUILD/boilerpipe-1.2.0/build/demo [javac] warning: [options] bootstrap class path not set in conjunction with -source 1.6 [javac] 1 warning jars: [jar] Building jar: /usr/src/RPM/BUILD/boilerpipe-1.2.0/dist/boilerpipe-1.2.0.jar [jar] Building jar: /usr/src/RPM/BUILD/boilerpipe-1.2.0/dist/boilerpipe-demo-1.2.0.jar [jar] Building jar: /usr/src/RPM/BUILD/boilerpipe-1.2.0/dist/boilerpipe-javadoc-1.2.0.jar [jar] Building jar: /usr/src/RPM/BUILD/boilerpipe-1.2.0/dist/boilerpipe-sources-1.2.0.jar dist: [tar] Building tar: /usr/src/RPM/BUILD/boilerpipe-1.2.0/dist/boilerpipe-1.2.0-bin.tar.gz [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/extractors/class-use/KeepEverythingWithMinKWordsExtractor.html longer than 100 characters. [tar] Resulting tar file can only be processed successfully by GNU compatible tar commands [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/class-use/DensityRulesClassifier.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/class-use/IgnoreBlocksAfterContentFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/class-use/IgnoreBlocksAfterContentFromEndFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/class-use/KeepLargestFulltextBlockFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/class-use/MinFulltextWordsFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/class-use/NumWordsRulesClassifier.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/class-use/TerminatingBlocksFinder.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/heuristics/class-use/AddPrecedingLabelsFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/heuristics/class-use/ArticleMetadataFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/heuristics/class-use/BlockProximityFusion.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/heuristics/class-use/DocumentTitleMatchClassifier.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/heuristics/class-use/ExpandTitleToContentFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/heuristics/class-use/KeepLargestBlockFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/heuristics/class-use/SimpleBlockFusionProcessor.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/simple/class-use/LabelToBoilerplateFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/simple/class-use/MarkEverythingContentFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/simple/class-use/SplitParagraphBlocksFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/simple/class-use/SurroundingToContentFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/sax/class-use/CommonTagActions.BlockTagLabelAction.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/sax/class-use/CommonTagActions.InlineTagLabelAction.html longer than 100 characters. [tar] Building tar: /usr/src/RPM/BUILD/boilerpipe-1.2.0/dist/boilerpipe-1.2.0-src.tar.gz [tar] Entry: boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java longer than 100 characters. [tar] Resulting tar file can only be processed successfully by GNU compatible tar commands BUILD SUCCESSFUL Total time: 4 seconds + exit 0 Executing(%install): /bin/sh -e /usr/src/tmp/rpm-tmp.92191 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + /bin/chmod -Rf u+rwX -- /usr/src/tmp/boilerpipe-buildroot + : + /bin/rm -rf -- /usr/src/tmp/boilerpipe-buildroot + cd boilerpipe-1.2.0 + python3 /usr/share/java-utils/mvn_artifact.py pom.xml dist/boilerpipe-1.2.0.jar + python3 /usr/share/java-utils/mvn_file.py de.l3s.boilerpipe:boilerpipe boilerpipe + xmvn-install -R .xmvn-reactor -n boilerpipe -d /usr/src/tmp/boilerpipe-buildroot [INFO] Installing artifact de.l3s.boilerpipe:boilerpipe:pom:1.2.0 [INFO] Installing artifact de.l3s.boilerpipe:boilerpipe:jar:1.2.0 [INFO] Installation successful + jdir=javadoc/1.2 + /bin/mkdir -p /usr/src/tmp/boilerpipe-buildroot/usr/share/license + '[' -d javadoc/1.2 ']' + install -dm755 /usr/src/tmp/boilerpipe-buildroot/usr/share/javadoc/boilerpipe + cp -pr javadoc/1.2/allclasses-frame.html javadoc/1.2/allclasses-noframe.html javadoc/1.2/constant-values.html javadoc/1.2/de javadoc/1.2/deprecated-list.html javadoc/1.2/help-doc.html javadoc/1.2/index-all.html javadoc/1.2/index.html javadoc/1.2/overview-frame.html javadoc/1.2/overview-summary.html javadoc/1.2/overview-tree.html javadoc/1.2/package-list javadoc/1.2/script.js javadoc/1.2/serialized-form.html javadoc/1.2/stylesheet.css /usr/src/tmp/boilerpipe-buildroot/usr/share/javadoc/boilerpipe + echo /usr/share/javadoc/boilerpipe + install -pm 644 dist/boilerpipe-demo-1.2.0.jar /usr/src/tmp/boilerpipe-buildroot/usr/share/java/boilerpipe-demo.jar + /usr/lib/rpm/brp-alt Cleaning files in /usr/src/tmp/boilerpipe-buildroot (auto) Verifying and fixing files in /usr/src/tmp/boilerpipe-buildroot (binconfig,pkgconfig,libtool,desktop) Checking contents of files in /usr/src/tmp/boilerpipe-buildroot/ (default) Compressing files in /usr/src/tmp/boilerpipe-buildroot (auto) Verifying ELF objects in /usr/src/tmp/boilerpipe-buildroot (arch=normal,fhs=normal,lfs=relaxed,lint=relaxed,rpath=normal,stack=normal,textrel=normal,unresolved=normal) Hardlinking identical .pyc and .pyo files Processing files: boilerpipe-1.2.0-alt1_12jpp8 Executing(%doc): /bin/sh -e /usr/src/tmp/rpm-tmp.59743 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + cd boilerpipe-1.2.0 + DOCDIR=/usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-1.2.0 + export DOCDIR + rm -rf /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-1.2.0 + /bin/mkdir -p /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-1.2.0 + cp -prL --no-dereference LICENSE.txt NOTICE.txt /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-1.2.0 + chmod -R go-w /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-1.2.0 + chmod -R a+rX /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-1.2.0 + exit 0 Finding Provides (using /usr/lib/rpm/find-provides) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.UuCYEP find-provides: running scripts (alternatives,debuginfo,lib,maven,osgi-fc,pam,perl,pkgconfig,python,shell) [INFO maven.prov] input: ['/usr/src/tmp/boilerpipe-buildroot/usr/share/maven-metadata/boilerpipe.xml'] [INFO maven.prov] mvn(de.l3s.boilerpipe:boilerpipe:pom:) = 1.2.0, mvn(de.l3s.boilerpipe:boilerpipe) = 1.2.0 [INFO osgi.prov] input: ['/usr/src/tmp/boilerpipe-buildroot/usr/share/java/boilerpipe-demo.jar', '/usr/src/tmp/boilerpipe-buildroot/usr/share/java/boilerpipe.jar'] Finding Requires (using /usr/lib/rpm/find-requires) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.3DTcMW find-requires: running scripts (cpp,debuginfo,files,javadoc,lib,maven,osgi-fc,pam,perl,pkgconfig,pkgconfiglib,python,rpmlib,shebang,shell,static,symlinks) [INFO maven.req] input: ['/usr/src/tmp/boilerpipe-buildroot/usr/share/maven-metadata/boilerpipe.xml'] [INFO maven.req] javapackages-tools, mvn(net.sourceforge.nekohtml:nekohtml) [INFO osgi.req] input: ['/usr/src/tmp/boilerpipe-buildroot/usr/share/java/boilerpipe-demo.jar', '/usr/src/tmp/boilerpipe-buildroot/usr/share/java/boilerpipe.jar'] Provides: mvn(de.l3s.boilerpipe:boilerpipe) = 1.2.0, mvn(de.l3s.boilerpipe:boilerpipe:pom:) = 1.2.0 Requires: javapackages-tools, mvn(net.sourceforge.nekohtml:nekohtml) Processing files: boilerpipe-javadoc-1.2.0-alt1_12jpp8 Executing(%doc): /bin/sh -e /usr/src/tmp/rpm-tmp.72796 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + cd boilerpipe-1.2.0 + DOCDIR=/usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-javadoc-1.2.0 + export DOCDIR + rm -rf /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-javadoc-1.2.0 + /bin/mkdir -p /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-javadoc-1.2.0 + cp -prL --no-dereference LICENSE.txt NOTICE.txt /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-javadoc-1.2.0 + chmod -R go-w /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-javadoc-1.2.0 + chmod -R a+rX /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-javadoc-1.2.0 + exit 0 Finding Provides (using /usr/lib/rpm/find-provides) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.LOuP48 find-provides: running scripts (alternatives,debuginfo,lib,maven,osgi-fc,pam,perl,pkgconfig,python,shell) Finding Requires (using /usr/lib/rpm/find-requires) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.aUAyDJ find-requires: running scripts (cpp,debuginfo,files,javadoc,lib,maven,osgi-fc,pam,perl,pkgconfig,pkgconfiglib,python,rpmlib,shebang,shell,static,symlinks) Requires: javapackages-tools Wrote: /usr/src/RPM/RPMS/noarch/boilerpipe-1.2.0-alt1_12jpp8.noarch.rpm Wrote: /usr/src/RPM/RPMS/noarch/boilerpipe-javadoc-1.2.0-alt1_12jpp8.noarch.rpm 26.73user 1.87system 0:28.21elapsed 101%CPU (0avgtext+0avgdata 115576maxresident)k 0inputs+0outputs (0major+371581minor)pagefaults 0swaps 47.59user 6.52system 1:02.00elapsed 87%CPU (0avgtext+0avgdata 118932maxresident)k 68640inputs+0outputs (0major+1040204minor)pagefaults 0swaps