<86>Nov 29 07:51:19 userdel[24396]: delete user 'rooter' <86>Nov 29 07:51:19 userdel[24396]: removed group 'rooter' owned by 'rooter' <86>Nov 29 07:51:19 userdel[24396]: removed shadow group 'rooter' owned by 'rooter' <86>Nov 29 07:51:19 groupadd[24410]: group added to /etc/group: name=rooter, GID=588 <86>Nov 29 07:51:19 groupadd[24410]: group added to /etc/gshadow: name=rooter <86>Nov 29 07:51:19 groupadd[24410]: new group: name=rooter, GID=588 <86>Nov 29 07:51:19 useradd[24422]: new user: name=rooter, UID=588, GID=588, home=/root, shell=/bin/bash <86>Nov 29 07:51:19 userdel[24443]: delete user 'builder' <86>Nov 29 07:51:19 userdel[24443]: removed group 'builder' owned by 'builder' <86>Nov 29 07:51:19 userdel[24443]: removed shadow group 'builder' owned by 'builder' <86>Nov 29 07:51:19 groupadd[24464]: group added to /etc/group: name=builder, GID=589 <86>Nov 29 07:51:19 groupadd[24464]: group added to /etc/gshadow: name=builder <86>Nov 29 07:51:19 groupadd[24464]: new group: name=builder, GID=589 <86>Nov 29 07:51:19 useradd[24475]: new user: name=builder, UID=589, GID=589, home=/usr/src, shell=/bin/bash warning: user igor does not exist - using root warning: group igor does not exist - using root warning: user igor does not exist - using root warning: group igor does not exist - using root warning: user igor does not exist - using root warning: group igor does not exist - using root warning: user igor does not exist - using root warning: group igor does not exist - using root warning: user igor does not exist - using root warning: group igor does not exist - using root warning: user igor does not exist - using root warning: group igor does not exist - using root <13>Nov 29 07:51:23 rpmi: rpm-macros-java-1:5.0.0-alt1_12jpp8 1525973129 installed <13>Nov 29 07:51:27 rpmi: javapackages-tools-1:5.0.0-alt1_12jpp8 1525973129 installed <13>Nov 29 07:51:27 rpmi: libjpeg-2:1.5.1-alt1 1498218318 installed <13>Nov 29 07:51:27 rpmi: libexpat-2.2.4-alt1 1503305345 installed <13>Nov 29 07:51:27 rpmi: libpng16-1.6.35-alt1 sisyphus.214397.100 1539159349 installed <13>Nov 29 07:51:27 rpmi: xorg-proto-devel-2018.4-alt3 1527685079 installed <13>Nov 29 07:51:27 rpmi: beust-jcommander-1.71-alt1_3jpp8 1523858260 installed <13>Nov 29 07:51:27 rpmi: xmvn-api-3.0.0-alt1_18jpp8 1527991448 installed <13>Nov 29 07:51:27 rpmi: xmvn-core-3.0.0-alt1_18jpp8 1527991448 installed <13>Nov 29 07:51:27 rpmi: xml-commons-apis-1.4.01-alt3_25jpp8 1524212083 installed <13>Nov 29 07:51:27 rpmi: libICE-1.0.9-alt1 1409902721 installed <13>Nov 29 07:51:27 rpmi: libogg-1.3.2-alt2 sisyphus.215919.100 1540973847 installed <13>Nov 29 07:51:27 rpmi: liblksctp-1.0.17-alt2 1523113261 installed <13>Nov 29 07:51:27 rpmi: libSM-1.2.3-alt1 sisyphus.215747.100 1540812795 installed <13>Nov 29 07:51:27 rpmi: java-common-1.5.0-alt1 1329330500 installed <13>Nov 29 07:51:28 rpmi: xml-utils-1:2.9.4.0.12.e905-alt1.1 1525115767 installed <13>Nov 29 07:51:28 rpmi: libgif-4.1.6-alt3 1299634261 installed <13>Nov 29 07:51:28 rpmi: libglvnd-7:1.1.0-alt3 sisyphus.215982.100 1541498632 installed <13>Nov 29 07:51:28 rpmi: libwayland-server-1.16.0-alt1 1535614871 installed <13>Nov 29 07:51:28 rpmi: libalsa-1:1.1.7-alt1 sisyphus.215150.100 1539797658 installed <13>Nov 29 07:51:28 rpmi: javazi-2018g-alt2 sisyphus.215795.200 1540855508 installed <13>Nov 29 07:51:28 rpmi: lksctp-tools-1.0.17-alt2 1523113261 installed <13>Nov 29 07:51:28 rpmi: libflac8-1.3.2-alt1 1507623955 installed <13>Nov 29 07:51:28 rpmi: libvorbis-1.3.6-alt1 1528307812 installed <13>Nov 29 07:51:28 rpmi: libICE-devel-1.0.9-alt1 1409902721 installed <13>Nov 29 07:51:28 rpmi: libSM-devel-1.2.3-alt1 sisyphus.215747.100 1540812795 installed <13>Nov 29 07:51:28 rpmi: libjasper-2.0.14-alt1 1530105217 installed <13>Nov 29 07:51:28 rpmi: libtiff5-4.0.3-alt1 1348347501 installed <13>Nov 29 07:51:28 rpmi: ant-lib-0:1.10.3-alt1_2jpp8 1528243545 installed <13>Nov 29 07:51:28 rpmi: objenesis-0:2.6-alt1_1jpp8 1511395274 installed <13>Nov 29 07:51:28 rpmi: apache-commons-compress-0:1.16.1-alt1_1jpp8 1526491832 installed <13>Nov 29 07:51:28 rpmi: bcel-1:6.2-alt1_2jpp8 1525817590 installed <13>Nov 29 07:51:28 rpmi: slf4j-0:1.7.25-alt1_4jpp8 1525924634 installed <13>Nov 29 07:51:28 rpmi: zip-30000000:3.0-alt1 1332241772 installed <13>Nov 29 07:51:28 rpmi: sgml-common-0.6.3-alt15 1423664786 installed <13>Nov 29 07:51:28 rpmi: docbook-dtds-4.5-alt1 1223476557 installed <13>Nov 29 07:51:29 rpmi: docbook-style-xsl-1.79.1-alt2 sisyphus.213665.100 1537949315 installed <13>Nov 29 07:51:29 rpmi: libnatspec-0.3.1-alt2 1445691580 installed <13>Nov 29 07:51:29 rpmi: unzip-6.0-alt2.qa1 1366155324 installed <13>Nov 29 07:51:29 rpmi: libgdbm-1.8.3-alt10 1454943334 installed <13>Nov 29 07:51:29 rpmi: libgsm-1.0.17-alt1 1523356165 installed <13>Nov 29 07:51:29 rpmi: libsndfile-1.0.28-alt2 sisyphus.212728.100 1536333068 installed <13>Nov 29 07:51:29 rpmi: libasyncns-0.8-alt2.qa1 1365949820 installed <13>Nov 29 07:51:29 rpmi: libgtk+2-locales-2.24.32-alt2 1518699309 installed <13>Nov 29 07:51:29 rpmi: libdatrie-0.2.9-alt1_6 1511686676 installed <13>Nov 29 07:51:29 rpmi: libthai-0.1.28-alt1_1 sisyphus.214516.100 1539257851 installed <13>Nov 29 07:51:29 rpmi: libfribidi-1.0.5-alt1 1532424345 installed <13>Nov 29 07:51:29 rpmi: libpixman-3:0.34.0-alt1 1480491657 installed <13>Nov 29 07:51:29 rpmi: libxshmfence-1.2-alt2 1518613552 installed <13>Nov 29 07:51:29 rpmi: libwayland-client-1.16.0-alt1 1535614871 installed <13>Nov 29 07:51:29 rpmi: libpciaccess-1:0.14-alt1 1528969252 installed <13>Nov 29 07:51:29 rpmi: libdrm-1:2.4.96-alt1 sisyphus.215486.100 1540374027 installed <13>Nov 29 07:51:29 rpmi: libgbm-4:18.2.5-alt1 sisyphus.216532.100 1542372718 installed <13>Nov 29 07:51:29 rpmi: libatk-locales-2.30.0-alt1 sisyphus.212779.100 1536768328 installed <13>Nov 29 07:51:29 rpmi: libatk-2.30.0-alt1 sisyphus.212779.100 1536768334 installed <13>Nov 29 07:51:29 rpmi: libX11-locales-3:1.6.7-alt1 sisyphus.214413.200 1539171080 installed <13>Nov 29 07:51:29 rpmi: libXdmcp-1.1.1-alt1 1334617701 installed <13>Nov 29 07:51:29 rpmi: libXau-1.0.8-alt1 1369565808 installed <13>Nov 29 07:51:29 rpmi: libxcb-1.13.1-alt1 sisyphus.214413.100 1539170896 installed <13>Nov 29 07:51:30 rpmi: libX11-3:1.6.7-alt1 sisyphus.214413.200 1539171143 installed <13>Nov 29 07:51:30 rpmi: libXext-1.3.3-alt1 1409902959 installed <13>Nov 29 07:51:30 rpmi: libXrender-0.9.8-alt1 1371312112 installed <13>Nov 29 07:51:30 rpmi: libXi-1.7.9-alt1.S1 1515755342 installed <13>Nov 29 07:51:30 rpmi: libXcomposite-0.4.3-alt3 1297306936 installed <13>Nov 29 07:51:30 rpmi: libXfixes-5.0.3-alt1 sisyphus.216396.300 1542022162 installed <13>Nov 29 07:51:30 rpmi: libXtst-1.2.2-alt1 1369984893 installed <13>Nov 29 07:51:30 rpmi: libXdamage-1.1.3-alt4 1297162593 installed <13>Nov 29 07:51:30 rpmi: libXcursor-1.1.15-alt1.S1 1512373366 installed <13>Nov 29 07:51:30 rpmi: libXrandr-1.5.0-alt1 1431936189 installed <13>Nov 29 07:51:30 rpmi: libXinerama-1.1.3-alt2 1527671619 installed <13>Nov 29 07:51:30 rpmi: libXxf86vm-1.1.4-alt2 1527672187 installed <13>Nov 29 07:51:30 rpmi: libGLX-mesa-4:18.2.5-alt1 sisyphus.216532.100 1542372718 installed <13>Nov 29 07:51:30 rpmi: libEGL-mesa-4:18.2.5-alt1 sisyphus.216532.100 1542372718 installed <13>Nov 29 07:51:30 rpmi: libEGL-7:1.1.0-alt3 sisyphus.215982.100 1541498632 installed <13>Nov 29 07:51:30 rpmi: libGLX-7:1.1.0-alt3 sisyphus.215982.100 1541498632 installed <13>Nov 29 07:51:30 rpmi: libGL-7:1.1.0-alt3 sisyphus.215982.100 1541498632 installed <13>Nov 29 07:51:30 rpmi: libXt-1.1.4-alt1 1369984722 installed <13>Nov 29 07:51:32 rpmi: libxcb-devel-1.13.1-alt1 sisyphus.214413.100 1539170896 installed <13>Nov 29 07:51:33 rpmi: libX11-devel-3:1.6.7-alt1 sisyphus.214413.200 1539171143 installed <13>Nov 29 07:51:33 rpmi: libXt-devel-1.1.4-alt1 1369984722 installed <13>Nov 29 07:51:33 rpmi: libpcsclite-1.8.23-alt1 1513827863 installed <13>Nov 29 07:51:33 rpmi: libverto-0.3.0-alt1_5 1525957714 installed <13>Nov 29 07:51:33 rpmi: libkeyutils-1.6-alt1 sisyphus.217029.100 1543414265 installed <13>Nov 29 07:51:33 rpmi: libcom_err-1.44.3-alt1 1532134732 installed <13>Nov 29 07:51:33 rpmi: liblz4-1:1.8.3-alt1 sisyphus.213737.100 1538009686 installed <13>Nov 29 07:51:33 rpmi: libgpg-error-1.31-alt1.S1 1529015802 installed <13>Nov 29 07:51:33 rpmi: libgcrypt20-1.8.3-alt3 sisyphus.214019.140 1538990448 installed <13>Nov 29 07:51:33 rpmi: libsystemd-1:239-alt3 sisyphus.215710.300 1540765641 installed <13>Nov 29 07:51:33 rpmi: libdbus-1.12.10-alt1 sisyphus.212941.100 1536831873 installed <13>Nov 29 07:51:33 rpmi: libavahi-0.6.32-alt1 1500485702 installed <13>Nov 29 07:51:33 rpmi: libpulseaudio-12.2-alt1 1535623585 installed <13>Nov 29 07:51:33 rpmi: libxslt-1.1.32-alt2 1517429984 installed <13>Nov 29 07:51:33 rpmi: libsqlite3-3.25.2-alt2 sisyphus.215082.100 1539700318 installed <13>Nov 29 07:51:33 rpmi: libnspr-1:4.20-alt1 sisyphus.216395.100 1542113039 installed <13>Nov 29 07:51:33 rpmi: libgraphite2-1.3.12-alt2.1 sisyphus.215942.100 1540990757 installed <13>Nov 29 07:51:33 rpmi: libharfbuzz-2.1.3-alt1 sisyphus.216837.100 1543085735 installed <13>Nov 29 07:51:33 rpmi: libfreetype-2.9.1-alt1.S1 1530781053 installed <13>Nov 29 07:51:33 rpmi: fontconfig-2.13.1-alt1 sisyphus.215917.100 1540973886 installed Updating fonts cache: <29>Nov 29 07:51:35 fontconfig: Updating fonts cache: succeeded [ DONE ] <13>Nov 29 07:51:35 rpmi: fonts-type1-xorg-7.0.0-alt4 1188553211 installed <13>Nov 29 07:51:35 rpmi: libcairo-1:1.16.0-alt1 sisyphus.215566.100 1540457683 installed <13>Nov 29 07:51:35 rpmi: libXft-2.3.2-alt1 1409902660 installed <13>Nov 29 07:51:35 rpmi: libpango-1.42.4-alt1 1534787259 installed <13>Nov 29 07:51:35 rpmi: icon-theme-hicolor-0.17-alt1 1505715846 installed <13>Nov 29 07:51:35 rpmi: libgdk-pixbuf-locales-2.38.0-alt2 sisyphus.213523.100 1537685512 installed <13>Nov 29 07:51:35 rpmi: shared-mime-info-1.10-alt1.1 1530525599 installed <13>Nov 29 07:51:35 rpmi: gsettings-desktop-schemas-data-3.28.1-alt1 sisyphus.212587.100 1536082062 installed <13>Nov 29 07:51:35 rpmi: libgio-2.58.1-alt3 sisyphus.214034.100 1538601697 installed <13>Nov 29 07:51:35 rpmi: gsettings-desktop-schemas-3.28.1-alt1 sisyphus.212587.100 1536082066 installed <13>Nov 29 07:51:35 rpmi: libgdk-pixbuf-2.38.0-alt2 sisyphus.213523.100 1537685557 installed <13>Nov 29 07:51:35 rpmi: gtk-update-icon-cache-3.24.1-alt1 sisyphus.213271.100 1537346078 installed <13>Nov 29 07:51:35 rpmi: libdbus-glib-1:0.106-alt1 1454672854 installed <13>Nov 29 07:51:35 rpmi: libtasn1-4.13-alt2 1521133850 installed <13>Nov 29 07:51:35 rpmi: libp11-kit-0.23.9-alt5 1525798298 installed <13>Nov 29 07:51:35 rpmi: rpm-macros-alternatives-0.4.5-alt1.1 1404382149 installed <13>Nov 29 07:51:35 rpmi: alternatives-0.4.5-alt1.1 1404382149 installed <13>Nov 29 07:51:36 rpmi: libnss-3.40.0-alt1 sisyphus.216395.200 1542113887 installed <13>Nov 29 07:51:36 rpmi: ca-certificates-2018.11.12-alt1 sisyphus.216395.300 1542114035 installed <13>Nov 29 07:51:36 rpmi: ca-trust-0.1.1-alt2 1515595785 installed <13>Nov 29 07:51:36 rpmi: p11-kit-trust-0.23.9-alt5 1525798298 installed <13>Nov 29 07:51:36 rpmi: libcrypto1.1-1.1.0j-alt1 sisyphus.216647.100 1542743878 installed <13>Nov 29 07:51:36 rpmi: libssl1.1-1.1.0j-alt1 sisyphus.216647.100 1542743878 installed <13>Nov 29 07:51:36 rpmi: libpython3-3.6.5-alt1.1 1535734576 installed <13>Nov 29 07:51:36 rpmi: rpm-build-python3-0.1.13.1-alt2 1535450458 installed <13>Nov 29 07:51:36 rpmi: tests-for-installed-python3-pkgs-0.1.13.1-alt2 1535450458 installed <13>Nov 29 07:51:36 rpmi: python3-3.6.5-alt1.1 1535734576 installed <13>Nov 29 07:51:37 rpmi: python3-base-3.6.5-alt1.1 1535734576 installed <86>Nov 29 07:51:37 groupadd[12499]: group added to /etc/group: name=_keytab, GID=499 <86>Nov 29 07:51:37 groupadd[12499]: group added to /etc/gshadow: name=_keytab <86>Nov 29 07:51:37 groupadd[12499]: new group: name=_keytab, GID=499 <13>Nov 29 07:51:37 rpmi: libkrb5-1.16.2-alt1 sisyphus.216047.100 1541159177 installed <13>Nov 29 07:51:37 rpmi: libcups-2.2.6-alt1 1510070343 installed <13>Nov 29 07:51:37 rpmi: python3-module-sugarbowl-0.52.1-alt1.git20141130.1.1 1517983623 installed <13>Nov 29 07:51:37 rpmi: python3-module-six-1.11.0-alt2 1535611135 installed <13>Nov 29 07:51:37 rpmi: ca-trust-java-0.1.1-alt2 1515595785 installed <13>Nov 29 07:51:42 rpmi: java-1.8.0-openjdk-headless-0:1.8.0.151-alt1_5.b12jpp8 1529924986 installed <13>Nov 29 07:51:43 rpmi: java-1.8.0-openjdk-0:1.8.0.151-alt1_5.b12jpp8 1529924986 installed <13>Nov 29 07:51:43 rpmi: libgtk+2-2.24.32-alt2 1518699309 installed <86>Nov 29 07:51:43 groupadd[20153]: group added to /etc/group: name=sasl, GID=498 <86>Nov 29 07:51:43 groupadd[20153]: group added to /etc/gshadow: name=sasl <86>Nov 29 07:51:43 groupadd[20153]: new group: name=sasl, GID=498 <13>Nov 29 07:51:43 rpmi: libsasl2-3-2.1.27-alt0.2 1535660695 installed <13>Nov 29 07:51:43 rpmi: libldap-2.4.46-alt1 1535562135 installed <13>Nov 29 07:51:43 rpmi: libGConf-3.2.6-alt3 1455932638 installed <13>Nov 29 07:51:47 rpmi: java-1.7.0-openjdk-headless-0:1.7.0.181-alt1_2.6.14.8jpp8 1528046800 installed <13>Nov 29 07:51:48 rpmi: java-1.7.0-openjdk-0:1.7.0.181-alt1_2.6.14.8jpp8 1528046800 installed <13>Nov 29 07:51:49 rpmi: java-1.7.0-openjdk-devel-0:1.7.0.181-alt1_2.6.14.8jpp8 1528046800 installed <13>Nov 29 07:51:49 rpmi: python3-module-markupsafe-0.23-alt1.2.1.1 1525118834 installed <13>Nov 29 07:51:49 rpmi: python3-module-jinja2-2.10-alt1 1521724576 installed <13>Nov 29 07:51:49 rpmi: python3-module-clyde-0.8.0-alt1.git20141130.2.1 1517980014 installed <13>Nov 29 07:51:49 rpmi: python3-module-pkg_resources-1:40.5.0-alt1 sisyphus.216029.100 1541106477 installed <13>Nov 29 07:51:49 rpmi: python3-module-runfile-0.46.1-alt1.git20141130.2.1 1517983182 installed <13>Nov 29 07:51:49 rpmi: objectweb-asm-0:6.1.1-alt1_1jpp8 1528136365 installed <13>Nov 29 07:51:50 rpmi: xmvn-install-3.0.0-alt1_18jpp8 1527991448 installed <13>Nov 29 07:51:50 rpmi: xmvn-subst-3.0.0-alt1_18jpp8 1527991448 installed <13>Nov 29 07:51:50 rpmi: xmvn-resolve-3.0.0-alt1_18jpp8 1527991448 installed <13>Nov 29 07:51:50 rpmi: xml-commons-resolver-0:1.2-alt1_24jpp8 1525932051 installed <13>Nov 29 07:51:50 rpmi: xalan-j2-0:2.7.1-alt4_34jpp8 1525931290 installed <13>Nov 29 07:51:50 rpmi: xerces-j2-0:2.11.0-alt3_31jpp8 1524211519 installed <13>Nov 29 07:51:50 rpmi: python3-module-genshi-0.7-alt1.1.1.1 1460400448 installed <13>Nov 29 07:51:50 rpmi: python3-module-webencodings-0.5.1-alt1.1 1517943573 installed <13>Nov 29 07:51:50 rpmi: python3-module-cssselect-0.9.1-alt1.2 1526980827 installed <13>Nov 29 07:51:50 rpmi: python3-module-html5lib-1:0.999999999-alt4.qa1 sisyphus.214868.100 1539741045 installed <13>Nov 29 07:51:50 rpmi: python3-module-lxml-4.2.1-alt1.1 1525119302 installed <13>Nov 29 07:51:50 rpmi: python3-module-javapackages-1:5.0.0-alt1_12jpp8 1525973129 installed <13>Nov 29 07:51:50 rpmi: rpm-build-java-1:5.0.0-alt1_12jpp8 1525973129 installed <13>Nov 29 07:51:50 rpmi: java-stub-javadoc-0.1-alt1 1229813340 installed <13>Nov 29 07:51:50 rpmi: jpackage-generic-compat-0.29-alt1 1523537205 installed <13>Nov 29 07:51:50 rpmi: javapackages-local-1:5.0.0-alt1_12jpp8 1525973129 installed <13>Nov 29 07:51:50 rpmi: nekohtml-0:1.9.22-alt1_6jpp8 1527988559 installed <13>Nov 29 07:51:52 rpmi: java-1.8.0-openjdk-devel-0:1.8.0.151-alt1_5.b12jpp8 1529924986 installed <13>Nov 29 07:51:52 rpmi: ant-0:1.10.3-alt1_2jpp8 1528243545 installed Building target platforms: i586 Building for target i586 Wrote: /usr/src/in/nosrpm/boilerpipe-1.2.0-alt1_11jpp8.nosrc.rpm Installing boilerpipe-1.2.0-alt1_11jpp8.src.rpm Building target platforms: i586 Building for target i586 Executing(%prep): /bin/sh -e /usr/src/tmp/rpm-tmp.10320 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + rm -rf boilerpipe-1.2.0 + echo 'Source #0 (boilerpipe-1.2.0-src.tar.gz):' Source #0 (boilerpipe-1.2.0-src.tar.gz): + /bin/gzip -dc /usr/src/RPM/SOURCES/boilerpipe-1.2.0-src.tar.gz + /bin/tar -xf - + cd boilerpipe-1.2.0 + /bin/chmod -c -Rf u+rwX,go-w . + find . -iname '*.jar' -delete + find . -iname '*.class' -delete + echo 'Patch #0 (boilerpipe-1.2.0-libdir-patch):' Patch #0 (boilerpipe-1.2.0-libdir-patch): + /usr/bin/patch -p0 patching file build.xml + cp /usr/src/RPM/SOURCES/boilerpipe-1.2.0.pom pom.xml + echo 'Patch #1 (boilerpipe-1.2.0-nekohtml-patch):' Patch #1 (boilerpipe-1.2.0-nekohtml-patch): + /usr/bin/patch -p1 patching file pom.xml patching file src/main/org/cyberneko/html/HTMLElements.java patching file src/main/org/cyberneko/html/HTMLTagBalancer.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextBlock.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/document/TextDocument.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/TagAction.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + exit 0 Executing(%build): /bin/sh -e /usr/src/tmp/rpm-tmp.13989 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + cd boilerpipe-1.2.0 + ant -Dapp.javaversion=1.6 Buildfile: /usr/src/RPM/BUILD/boilerpipe-1.2.0/build.xml clean: [mkdir] Created dir: /usr/src/RPM/BUILD/boilerpipe-1.2.0/javadoc/1.2 init: [mkdir] Created dir: /usr/src/RPM/BUILD/boilerpipe-1.2.0/build [mkdir] Created dir: /usr/src/RPM/BUILD/boilerpipe-1.2.0/build/main [mkdir] Created dir: /usr/src/RPM/BUILD/boilerpipe-1.2.0/build/demo [mkdir] Created dir: /usr/src/RPM/BUILD/boilerpipe-1.2.0/dist javadoc: [javadoc] Generating Javadoc [javadoc] Javadoc execution [javadoc] Loading source files for package de.l3s.boilerpipe... [javadoc] Loading source files for package de.l3s.boilerpipe.conditions... [javadoc] Loading source files for package de.l3s.boilerpipe.document... [javadoc] Loading source files for package de.l3s.boilerpipe.estimators... [javadoc] Loading source files for package de.l3s.boilerpipe.extractors... [javadoc] Loading source files for package de.l3s.boilerpipe.filters.english... [javadoc] Loading source files for package de.l3s.boilerpipe.filters.heuristics... [javadoc] Loading source files for package de.l3s.boilerpipe.filters.simple... [javadoc] Loading source files for package de.l3s.boilerpipe.labels... [javadoc] Loading source files for package de.l3s.boilerpipe.sax... [javadoc] Loading source files for package de.l3s.boilerpipe.util... [javadoc] Constructing Javadoc information... [javadoc] Standard Doclet version 1.8.0_151 [javadoc] Building tree for all the packages and classes... [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java:21: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java:33: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java:44: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java:54: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/BoilerpipeFilter.java:36: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/BoilerpipeInput.java:32: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java:33: warning: no description for @param [javadoc] * @param tb [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java:34: error: malformed HTML [javadoc] * @return iff the condition is met. [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/document/TextBlock.java:252: warning: no description for @return [javadoc] * @return [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/document/TextDocument.java:78: warning: no description for @param [javadoc] * @param title [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java:46: warning: no description for @param [javadoc] * @param dsBefore [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java:47: warning: no description for @param [javadoc] * @param dsAfter [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java:43: warning: no @return [javadoc] public static ArticleExtractor getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java:47: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java:64: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java:83: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java:98: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java:109: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java:36: warning: no @return [javadoc] public static ArticleSentencesExtractor getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java:43: warning: no @return [javadoc] public static CanolaExtractor getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java:37: warning: no @return [javadoc] public static DefaultExtractor getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java:42: warning: no @return [javadoc] public static LargestContentExtractor getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java:36: warning: no @return [javadoc] public static NumWordsRulesExtractor getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java:43: warning: no @return [javadoc] public static DensityRulesClassifier getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java:47: warning: no @return [javadoc] public static IgnoreBlocksAfterContentFilter getDefaultInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java:42: warning: no @return [javadoc] public static NumWordsRulesClassifier getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java:40: warning: no @return [javadoc] public static TerminatingBlocksFinder getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java:44: error: @param name not found [javadoc] * @param maxBlocksDistance The maximum distance in blocks. [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java:45: error: @param name not found [javadoc] * @param contentOnly [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java:45: warning: no description for @param [javadoc] * @param contentOnly [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java:47: warning: no @param for labelPrefix [javadoc] public AddPrecedingLabelsFilter(final String labelPrefix) { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java:55: warning: no description for @param [javadoc] * @param contentOnly [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java:57: warning: no @param for sameTagLevelOnly [javadoc] public BlockProximityFusion(final int maxBlocksDistance, [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java:40: warning: no @return [javadoc] public static ExpandTitleToContentFilter getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java:45: error: @param name not found [javadoc] * @param maxBlocksDistance The maximum distance in blocks. [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java:46: error: @param name not found [javadoc] * @param contentOnly [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java:46: warning: no description for @param [javadoc] * @param contentOnly [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java:48: warning: no @param for labelPrefix [javadoc] public LabelFusion(final String labelPrefix) { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java:39: warning: no @return [javadoc] public static SimpleBlockFusionProcessor getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java:39: warning: no @return [javadoc] public static BoilerplateBlockFilter getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java:45: warning: no @return [javadoc] public static SplitParagraphBlocksFilter getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java:47: warning: no description for @param [javadoc] * @param contentHandler [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java:59: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java:40: warning: no description for @param [javadoc] * @param is [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java:41: warning: no description for @throws [javadoc] * @throws SAXException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java:27: warning: no description for @param [javadoc] * @param url [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java:28: warning: no description for @return [javadoc] * @return [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java:29: warning: no description for @throws [javadoc] * @throws IOException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:54: warning: no @return [javadoc] public static HTMLHighlighter newHighlightingInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:62: warning: no @return [javadoc] public static HTMLHighlighter newExtractingInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:88: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:90: warning: no @return [javadoc] public String process(final TextDocument doc, final String origHTML) [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:103: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:105: warning: no @return [javadoc] public String process(final TextDocument doc, final InputSource is) [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:162: warning: no @return [javadoc] public boolean isOutputHighlightOnly() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:170: warning: no @param for outputHighlightOnly [javadoc] public void setOutputHighlightOnly(boolean outputHighlightOnly) { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:181: warning: no @return [javadoc] public String getExtraStyleSheet() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:203: error: invalid entity &qupt; [javadoc] * <span class=&qupt;x-boilerpipe-mark1"> [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:205: warning: no @return [javadoc] public String getPreHighlight() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:215: warning: no @param for preHighlight [javadoc] public void setPreHighlight(String preHighlight) { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:225: warning: no @return [javadoc] public String getPostHighlight() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:234: warning: no @param for postHighlight [javadoc] public void setPostHighlight(String postHighlight) { [javadoc] ^ [javadoc] Building index for all the packages and classes... [javadoc] Building index for all classes... [javadoc] Generating /usr/src/RPM/BUILD/boilerpipe-1.2.0/javadoc/1.2/help-doc.html... [javadoc] 6 errors [javadoc] 56 warnings compile: [javac] /usr/src/RPM/BUILD/boilerpipe-1.2.0/build.xml:93: warning: 'includeantruntime' was not set, defaulting to build.sysclasspath=last; set to false for repeatable builds [javac] Compiling 62 source files to /usr/src/RPM/BUILD/boilerpipe-1.2.0/build/main [javac] warning: [options] bootstrap class path not set in conjunction with -source 1.6 [javac] 1 warning [javac] /usr/src/RPM/BUILD/boilerpipe-1.2.0/build.xml:94: warning: 'includeantruntime' was not set, defaulting to build.sysclasspath=last; set to false for repeatable builds [javac] Compiling 3 source files to /usr/src/RPM/BUILD/boilerpipe-1.2.0/build/demo [javac] warning: [options] bootstrap class path not set in conjunction with -source 1.6 [javac] 1 warning jars: [jar] Building jar: /usr/src/RPM/BUILD/boilerpipe-1.2.0/dist/boilerpipe-1.2.0.jar [jar] Building jar: /usr/src/RPM/BUILD/boilerpipe-1.2.0/dist/boilerpipe-demo-1.2.0.jar [jar] Building jar: /usr/src/RPM/BUILD/boilerpipe-1.2.0/dist/boilerpipe-javadoc-1.2.0.jar [jar] Building jar: /usr/src/RPM/BUILD/boilerpipe-1.2.0/dist/boilerpipe-sources-1.2.0.jar dist: [tar] Building tar: /usr/src/RPM/BUILD/boilerpipe-1.2.0/dist/boilerpipe-1.2.0-bin.tar.gz [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/extractors/class-use/KeepEverythingWithMinKWordsExtractor.html longer than 100 characters. [tar] Resulting tar file can only be processed successfully by GNU compatible tar commands [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/class-use/DensityRulesClassifier.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/class-use/IgnoreBlocksAfterContentFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/class-use/IgnoreBlocksAfterContentFromEndFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/class-use/KeepLargestFulltextBlockFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/class-use/MinFulltextWordsFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/class-use/NumWordsRulesClassifier.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/class-use/TerminatingBlocksFinder.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/heuristics/class-use/AddPrecedingLabelsFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/heuristics/class-use/ArticleMetadataFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/heuristics/class-use/BlockProximityFusion.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/heuristics/class-use/DocumentTitleMatchClassifier.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/heuristics/class-use/ExpandTitleToContentFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/heuristics/class-use/KeepLargestBlockFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/heuristics/class-use/SimpleBlockFusionProcessor.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/simple/class-use/LabelToBoilerplateFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/simple/class-use/MarkEverythingContentFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/simple/class-use/SplitParagraphBlocksFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/simple/class-use/SurroundingToContentFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/sax/class-use/CommonTagActions.BlockTagLabelAction.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/sax/class-use/CommonTagActions.InlineTagLabelAction.html longer than 100 characters. [tar] Building tar: /usr/src/RPM/BUILD/boilerpipe-1.2.0/dist/boilerpipe-1.2.0-src.tar.gz [tar] Entry: boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java longer than 100 characters. [tar] Resulting tar file can only be processed successfully by GNU compatible tar commands BUILD SUCCESSFUL Total time: 6 seconds + exit 0 Executing(%install): /bin/sh -e /usr/src/tmp/rpm-tmp.85671 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + /bin/chmod -Rf u+rwX -- /usr/src/tmp/boilerpipe-buildroot + : + /bin/rm -rf -- /usr/src/tmp/boilerpipe-buildroot + cd boilerpipe-1.2.0 + python3 /usr/share/java-utils/mvn_artifact.py pom.xml dist/boilerpipe-1.2.0.jar + python3 /usr/share/java-utils/mvn_file.py de.l3s.boilerpipe:boilerpipe boilerpipe + xmvn-install -R .xmvn-reactor -n boilerpipe -d /usr/src/tmp/boilerpipe-buildroot [INFO] Installing artifact de.l3s.boilerpipe:boilerpipe:pom:1.2.0 [INFO] Installing artifact de.l3s.boilerpipe:boilerpipe:jar:1.2.0 [INFO] Installation successful + jdir=javadoc/1.2 + /bin/mkdir -p /usr/src/tmp/boilerpipe-buildroot/usr/share/license + '[' -d javadoc/1.2 ']' + install -dm755 /usr/src/tmp/boilerpipe-buildroot/usr/share/javadoc/boilerpipe + cp -pr javadoc/1.2/allclasses-frame.html javadoc/1.2/allclasses-noframe.html javadoc/1.2/constant-values.html javadoc/1.2/de javadoc/1.2/deprecated-list.html javadoc/1.2/help-doc.html javadoc/1.2/index-all.html javadoc/1.2/index.html javadoc/1.2/overview-frame.html javadoc/1.2/overview-summary.html javadoc/1.2/overview-tree.html javadoc/1.2/package-list javadoc/1.2/script.js javadoc/1.2/serialized-form.html javadoc/1.2/stylesheet.css /usr/src/tmp/boilerpipe-buildroot/usr/share/javadoc/boilerpipe + echo /usr/share/javadoc/boilerpipe + install -pm 644 dist/boilerpipe-demo-1.2.0.jar /usr/src/tmp/boilerpipe-buildroot/usr/share/java/boilerpipe-demo.jar + /usr/lib/rpm/brp-alt Cleaning files in /usr/src/tmp/boilerpipe-buildroot (auto) Verifying and fixing files in /usr/src/tmp/boilerpipe-buildroot (binconfig,pkgconfig,libtool,desktop) Checking contents of files in /usr/src/tmp/boilerpipe-buildroot/ (default) Compressing files in /usr/src/tmp/boilerpipe-buildroot (auto) Verifying ELF objects in /usr/src/tmp/boilerpipe-buildroot (arch=normal,fhs=normal,lfs=relaxed,lint=relaxed,rpath=normal,stack=normal,textrel=normal,unresolved=normal) Hardlinking identical .pyc and .opt-?.pyc files Hardlinking identical .pyc and .pyo files Processing files: boilerpipe-1.2.0-alt1_11jpp8 Executing(%doc): /bin/sh -e /usr/src/tmp/rpm-tmp.32524 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + cd boilerpipe-1.2.0 + DOCDIR=/usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-1.2.0 + export DOCDIR + rm -rf /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-1.2.0 + /bin/mkdir -p /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-1.2.0 + cp -prL --no-dereference LICENSE.txt NOTICE.txt /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-1.2.0 + chmod -R go-w /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-1.2.0 + chmod -R a+rX /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-1.2.0 + exit 0 Finding Provides (using /usr/lib/rpm/find-provides) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.eiNqp2 find-provides: running scripts (alternatives,debuginfo,lib,maven,osgi-fc,pam,perl,pkgconfig,python,python3,shell) [INFO maven.prov] input: ['/usr/src/tmp/boilerpipe-buildroot/usr/share/maven-metadata/boilerpipe.xml'] [INFO maven.prov] mvn(de.l3s.boilerpipe:boilerpipe) = 1.2.0, mvn(de.l3s.boilerpipe:boilerpipe:pom:) = 1.2.0 [INFO osgi.prov] input: ['/usr/src/tmp/boilerpipe-buildroot/usr/share/java/boilerpipe-demo.jar', '/usr/src/tmp/boilerpipe-buildroot/usr/share/java/boilerpipe.jar'] Finding Requires (using /usr/lib/rpm/find-requires) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.zTBTaE find-requires: running scripts (cpp,debuginfo,files,javadoc,lib,maven,osgi-fc,pam,perl,pkgconfig,pkgconfiglib,python,python3,rpmlib,shebang,shell,static,symlinks) [INFO maven.req] input: ['/usr/src/tmp/boilerpipe-buildroot/usr/share/maven-metadata/boilerpipe.xml'] [INFO maven.req] javapackages-tools, mvn(net.sourceforge.nekohtml:nekohtml) [INFO osgi.req] input: ['/usr/src/tmp/boilerpipe-buildroot/usr/share/java/boilerpipe-demo.jar', '/usr/src/tmp/boilerpipe-buildroot/usr/share/java/boilerpipe.jar'] Provides: mvn(de.l3s.boilerpipe:boilerpipe) = 1.2.0, mvn(de.l3s.boilerpipe:boilerpipe:pom:) = 1.2.0 Requires: javapackages-tools, mvn(net.sourceforge.nekohtml:nekohtml) Processing files: boilerpipe-javadoc-1.2.0-alt1_11jpp8 Executing(%doc): /bin/sh -e /usr/src/tmp/rpm-tmp.55685 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + cd boilerpipe-1.2.0 + DOCDIR=/usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-javadoc-1.2.0 + export DOCDIR + rm -rf /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-javadoc-1.2.0 + /bin/mkdir -p /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-javadoc-1.2.0 + cp -prL --no-dereference LICENSE.txt NOTICE.txt /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-javadoc-1.2.0 + chmod -R go-w /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-javadoc-1.2.0 + chmod -R a+rX /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-javadoc-1.2.0 + exit 0 Finding Provides (using /usr/lib/rpm/find-provides) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.isQuLm find-provides: running scripts (alternatives,debuginfo,lib,maven,osgi-fc,pam,perl,pkgconfig,python,python3,shell) Finding Requires (using /usr/lib/rpm/find-requires) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.6xPaus find-requires: running scripts (cpp,debuginfo,files,javadoc,lib,maven,osgi-fc,pam,perl,pkgconfig,pkgconfiglib,python,python3,rpmlib,shebang,shell,static,symlinks) Requires: javapackages-tools Wrote: /usr/src/RPM/RPMS/noarch/boilerpipe-1.2.0-alt1_11jpp8.noarch.rpm Wrote: /usr/src/RPM/RPMS/noarch/boilerpipe-javadoc-1.2.0-alt1_11jpp8.noarch.rpm 48.59user 2.95system 0:42.83elapsed 120%CPU (0avgtext+0avgdata 129684maxresident)k 0inputs+0outputs (0major+398111minor)pagefaults 0swaps 90.37user 13.09system 1:37.62elapsed 105%CPU (0avgtext+0avgdata 129684maxresident)k 0inputs+0outputs (0major+1130966minor)pagefaults 0swaps