<86>Jul 25 10:37:41 userdel[4145889]: delete user 'rooter' <86>Jul 25 10:37:41 userdel[4145889]: removed group 'rooter' owned by 'rooter' <86>Jul 25 10:37:41 userdel[4145889]: removed shadow group 'rooter' owned by 'rooter' <86>Jul 25 10:37:41 groupadd[4145909]: group added to /etc/group: name=rooter, GID=681 <86>Jul 25 10:37:41 groupadd[4145909]: group added to /etc/gshadow: name=rooter <86>Jul 25 10:37:41 groupadd[4145909]: new group: name=rooter, GID=681 <86>Jul 25 10:37:41 useradd[4145928]: new user: name=rooter, UID=681, GID=681, home=/root, shell=/bin/bash <86>Jul 25 10:37:41 userdel[4145960]: delete user 'builder' <86>Jul 25 10:37:41 userdel[4145960]: removed group 'builder' owned by 'builder' <86>Jul 25 10:37:41 userdel[4145960]: removed shadow group 'builder' owned by 'builder' <86>Jul 25 10:37:41 groupadd[4145981]: group added to /etc/group: name=builder, GID=682 <86>Jul 25 10:37:41 groupadd[4145981]: group added to /etc/gshadow: name=builder <86>Jul 25 10:37:41 groupadd[4145981]: new group: name=builder, GID=682 <86>Jul 25 10:37:41 useradd[4146002]: new user: name=builder, UID=682, GID=682, home=/usr/src, shell=/bin/bash /usr/src/in/srpm/boilerpipe-1.2.0-alt1_13jpp8.src.rpm: license not found in '/usr/share/license' directory: ASL /usr/src/in/srpm/boilerpipe-1.2.0-alt1_13jpp8.src.rpm: license not found in '/usr/share/license' directory: 2.0 <13>Jul 25 10:37:42 rpmi: rpm-macros-java-1:5.3.0-alt1_4jpp8 sisyphus+234467.100.1.1 1563037789 installed <13>Jul 25 10:37:45 rpmi: javapackages-filesystem-1:5.3.0-alt1_4jpp8 sisyphus+234467.100.1.1 1563037789 installed <13>Jul 25 10:37:45 rpmi: javapackages-tools-1:5.3.0-alt1_4jpp8 sisyphus+234467.100.1.1 1563037789 installed <13>Jul 25 10:37:45 rpmi: libjpeg-2:2.0.2-alt1 sisyphus+226996.100.1.1 1554902907 installed <13>Jul 25 10:37:45 rpmi: libpng16-1.6.37-alt1 sisyphus+236677.100.2.1 1566917998 installed <13>Jul 25 10:37:45 rpmi: liblcms2-2.11-alt1 sisyphus+253499.100.1.1 1592287020 installed <13>Jul 25 10:37:45 rpmi: libexpat-2.2.9-alt1 sisyphus+252464.200.2.1 1590958865 installed <13>Jul 25 10:37:45 rpmi: xorg-proto-devel-2020.1-alt1 sisyphus+250406.100.1.1 1587554810 installed <13>Jul 25 10:37:45 rpmi: perl-HTTP-Date-6.04-alt1 sisyphus+241046.100.1.1 1574192946 installed <13>Jul 25 10:37:45 rpmi: libICE-1.0.10-alt1 sisyphus+247690.100.1.1 1584000387 installed <13>Jul 25 10:37:45 rpmi: libSM-1.2.3-alt1 sisyphus+226734.100.2.1 1554586158 installed <13>Jul 25 10:37:45 rpmi: beust-jcommander-1.71-alt1_6jpp8 sisyphus+230680.100.1.3 1559093321 installed <13>Jul 25 10:37:45 rpmi: xmvn-api-3.0.0-alt1_23jpp8 sisyphus+234592.200.1.1 1563216657 installed <13>Jul 25 10:37:45 rpmi: xmvn-core-3.0.0-alt1_23jpp8 sisyphus+234592.200.1.1 1563216657 installed <13>Jul 25 10:37:45 rpmi: xml-commons-apis-1.4.01-alt3_29jpp8 sisyphus+246084.100.1.1 1581616535 installed <13>Jul 25 10:37:45 rpmi: liblksctp-1.0.17-alt2 1523113261 installed <13>Jul 25 10:37:45 rpmi: perl-XML-NamespaceSupport-1.12-alt1 1491296348 installed <13>Jul 25 10:37:45 rpmi: libsqlite3-3.32.3-alt1 sisyphus+253798.100.1.1 1592756163 installed <13>Jul 25 10:37:45 rpmi: libidn2-2.3.0-alt1 sisyphus+240846.100.1.2 1573870475 installed <13>Jul 25 10:37:45 rpmi: lksctp-tools-1.0.17-alt2 1523113261 installed <13>Jul 25 10:37:45 rpmi: java-common-1.6.0-alt1 sisyphus+234020.100.1.1 1562437039 installed <13>Jul 25 10:37:45 rpmi: xml-utils-1:2.9.10-alt3 sisyphus+245000.16400.79.1 1583230501 installed <13>Jul 25 10:37:45 rpmi: libpcsclite-1.9.0-alt1 sisyphus+253463.100.1.1 1592202070 installed <13>Jul 25 10:37:45 rpmi: javazi-2020a-alt1 sisyphus+250575.200.1.1 1587740494 installed <13>Jul 25 10:37:45 rpmi: libgif-4.1.6-alt3 1299634261 installed <13>Jul 25 10:37:45 rpmi: libusb-1.0.23-alt1 sisyphus+237317.100.1.1 1568059905 installed <13>Jul 25 10:37:45 rpmi: perl-LWP-MediaTypes-6.04-alt1 sisyphus+225468.100.1.1 1553186684 installed <13>Jul 25 10:37:45 rpmi: perl-Compress-Raw-Zlib-2.095-alt1 sisyphus+255277.100.1.1 1595512158 installed <13>Jul 25 10:37:45 rpmi: perl-libnet-1:3.11-alt1 1511423541 installed <13>Jul 25 10:37:45 rpmi: perl-XML-SAX-Base-1.09-alt1 1494364363 installed <13>Jul 25 10:37:45 rpmi: libfribidi-1.0.10-alt1 sisyphus+254557.100.1.1 1594020362 installed <13>Jul 25 10:37:45 rpmi: libepoxy-1.5.4-alt1 sisyphus+242061.100.1.1 1575190160 installed <13>Jul 25 10:37:45 rpmi: libnettle8-3.6-alt1 sisyphus+251637.100.3.1 1590060253 installed <13>Jul 25 10:37:45 rpmi: libpaper-1.1.26-alt1 sisyphus+221360.100.1.1 1549974197 installed <13>Jul 25 10:37:45 rpmi: libopenjpeg2.0-2.3.1-alt1 sisyphus+226454.100.1.1 1554284337 installed <13>Jul 25 10:37:45 rpmi: libnspr-1:4.26-alt1 sisyphus+254237.100.1.1 1593450639 installed <13>Jul 25 10:37:45 rpmi: libglvnd-7:1.3.2-alt1 sisyphus+254610.100.1.1 1594124268 installed <13>Jul 25 10:37:45 rpmi: libffi6-1:3.2.1-alt4 sisyphus+251953.300.2.1 1589891360 installed <13>Jul 25 10:37:45 rpmi: libwayland-client-1.18.0-alt1 sisyphus+245906.100.1.1 1581492503 installed <13>Jul 25 10:37:45 rpmi: libwayland-server-1.18.0-alt1 sisyphus+245906.100.1.1 1581492503 installed <13>Jul 25 10:37:45 rpmi: libp11-kit-0.23.15-alt2 sisyphus+252784.100.2.2 1591274901 installed <13>Jul 25 10:37:45 rpmi: libtasn1-4.16.0-alt1 sisyphus+245480.100.1.1 1580825062 installed <13>Jul 25 10:37:45 rpmi: libwayland-cursor-1.18.0-alt1 sisyphus+245906.100.1.1 1581492503 installed <13>Jul 25 10:37:45 rpmi: libwayland-egl-4:18.1.0-alt1 sisyphus+245906.100.1.1 1581492503 installed <13>Jul 25 10:37:45 rpmi: libhogweed6-3.6-alt1 sisyphus+251637.100.3.1 1590060253 installed <13>Jul 25 10:37:45 rpmi: libgnutls30-3.6.14-alt1 sisyphus+252951.100.1.1 1591438090 installed <13>Jul 25 10:37:45 rpmi: libICE-devel-1.0.10-alt1 sisyphus+247690.100.1.1 1584000387 installed <13>Jul 25 10:37:45 rpmi: libSM-devel-1.2.3-alt1 sisyphus+226734.100.2.1 1554586158 installed <13>Jul 25 10:37:45 rpmi: perl-File-Listing-6.04-alt1 1329758996 installed <13>Jul 25 10:37:45 rpmi: libqpdf28-10.0.1-alt1 sisyphus+249852.100.1.1 1586771283 installed <13>Jul 25 10:37:45 rpmi: libjasper-2.0.16-alt1 sisyphus+231386.100.1.1 1559568070 installed <13>Jul 25 10:37:45 rpmi: apache-commons-compress-0:1.18-alt1_4jpp8 sisyphus+230079.100.1.3 1558794575 installed <13>Jul 25 10:37:46 rpmi: ant-lib-0:1.10.5-alt1_5jpp8 sisyphus+232747.100.2.1 1561092977 installed <13>Jul 25 10:37:46 rpmi: bcel-1:6.2-alt1_4jpp8 sisyphus+234754.100.1.2 1563384029 installed <13>Jul 25 10:37:46 rpmi: slf4j-0:1.7.25-alt1_6jpp8 sisyphus+234787.100.1.2 1563401783 installed <13>Jul 25 10:37:46 rpmi: zip-30000000:3.0-alt1 1332241772 installed <13>Jul 25 10:37:46 rpmi: sgml-common-0.6.3-alt15 1423664786 installed <13>Jul 25 10:37:46 rpmi: docbook-dtds-4.5-alt1 1223476557 installed <13>Jul 25 10:37:46 rpmi: docbook-style-xsl-1.79.1-alt4 sisyphus+232871.100.1.1 1561238010 installed <13>Jul 25 10:37:46 rpmi: libnatspec-0.3.1-alt2 1445691580 installed <13>Jul 25 10:37:46 rpmi: unzip-6.0-alt3 sisyphus+244330.100.1.1 1579094108 installed <13>Jul 25 10:37:46 rpmi: libgdbm-1.8.3-alt10 1454943334 installed <13>Jul 25 10:37:46 rpmi: printer-testpages-2.0-alt2 1148643941 installed <13>Jul 25 10:37:46 rpmi: libgtk+2-locales-2.24.32-alt4 sisyphus+248211.200.2.1 1584869557 installed <13>Jul 25 10:37:46 rpmi: icon-theme-hicolor-0.17-alt2 sisyphus+248343.100.1.1 1584979043 installed <13>Jul 25 10:37:46 rpmi: libxkbcommon-0.10.0-alt1 sisyphus+244530.100.1.1 1579516274 installed <13>Jul 25 10:37:46 rpmi: libgudev-1:233-alt1 sisyphus+235422.100.1.1 1564855273 installed <13>Jul 25 10:37:46 rpmi: udev-rules-1:245.6-alt2 sisyphus+254630.100.3.1 1594643299 installed <13>Jul 25 10:37:46 rpmi: perl-Try-Tiny-0.30-alt1 1514318058 installed <13>Jul 25 10:37:46 rpmi: perl-IO-Socket-IP-0.39-alt1 1494508514 installed <13>Jul 25 10:37:46 rpmi: perl-Compress-Raw-Bzip2-2.095-alt1 sisyphus+255276.100.1.1 1595511686 installed <13>Jul 25 10:37:46 rpmi: perl-HTML-Tagset-3.20-alt2 1317725093 installed <13>Jul 25 10:37:46 rpmi: perl-Term-ANSIColor-5.01-alt1 sisyphus+244783.100.1.2 1579747505 installed <13>Jul 25 10:37:46 rpmi: perl-Data-Dump-1.23-alt1 1444601978 installed <13>Jul 25 10:37:46 rpmi: perl-Filter-1.59-alt1.1 sisyphus+219907.400.1.1 1548343389 installed <13>Jul 25 10:37:46 rpmi: perl-Encode-3.04-alt1 sisyphus+247835.100.1.1 1584190307 installed <13>Jul 25 10:37:46 rpmi: perl-URI-1.76-alt1 sisyphus+220243.100.1.1 1548863244 installed <13>Jul 25 10:37:46 rpmi: perl-IO-Compress-2.093-alt1 sisyphus+243543.100.1.1 1577294382 installed <13>Jul 25 10:37:46 rpmi: perl-Net-HTTP-6.19-alt1 sisyphus+229756.100.1.1 1558454558 installed <13>Jul 25 10:37:46 rpmi: perl-HTML-Parser-3.72-alt1.2 sisyphus+219907.600.1.1 1548343581 installed <13>Jul 25 10:37:46 rpmi: perl-WWW-RobotRules-6.02-alt1 1329756211 installed <13>Jul 25 10:37:46 rpmi: perl-Encode-Locale-1.05-alt1 1444608613 installed <13>Jul 25 10:37:46 rpmi: perl-IO-HTML-1.001-alt1 1404821752 installed <13>Jul 25 10:37:46 rpmi: perl-HTTP-Message-6.25-alt1 sisyphus+254521.100.1.1 1593894315 installed <13>Jul 25 10:37:46 rpmi: perl-HTTP-Cookies-6.08-alt1 sisyphus+242242.100.1.1 1575454022 installed <13>Jul 25 10:37:46 rpmi: perl-HTTP-Negotiate-6.01-alt1 1329760563 installed <13>Jul 25 10:37:46 rpmi: perl-libwww-6.46-alt1 sisyphus+254012.100.1.1 1593105927 installed <13>Jul 25 10:37:46 rpmi: perl-XML-LibXML-2.0202-alt1 sisyphus+246834.100.1.1 1582544040 installed <13>Jul 25 10:37:46 rpmi: perl-XML-SAX-1.02-alt1 sisyphus+232322.100.1.1 1560758406 installed <13>Jul 25 10:37:46 rpmi: perl-XML-Simple-2.25-alt1 1521437714 installed <13>Jul 25 10:37:46 rpmi: icon-naming-utils-0.8.90-alt1 1236573102 installed <13>Jul 25 10:37:47 rpmi: icon-theme-adwaita-3.36.1-alt1 sisyphus+250137.100.1.1 1587127395 installed <13>Jul 25 10:37:47 rpmi: libdatrie-0.2.9-alt1_6 1511686676 installed <13>Jul 25 10:37:47 rpmi: libthai-0.1.28-alt1_1 sisyphus+226107.100.1.1 1554123079 installed <13>Jul 25 10:37:47 rpmi: libgdk-pixbuf-locales-2.40.0-alt1 sisyphus+238952.140.2.1 1570644607 installed <13>Jul 25 10:37:47 rpmi: gtk+3-themes-incompatible-3.20-alt3 1461944560 installed <13>Jul 25 10:37:47 rpmi: libproxy-0.4.15-alt3.1 sisyphus+249308.100.1.1 1585930360 installed <13>Jul 25 10:37:47 rpmi: libwebp7-1.1.0-alt1 sisyphus+243895.100.1.1 1578410873 installed <13>Jul 25 10:37:47 rpmi: libjbig-2.1-alt1 1401380926 installed <13>Jul 25 10:37:47 rpmi: libtiff5-4.1.0-alt1 sisyphus+240802.100.1.1 1573743635 installed <13>Jul 25 10:37:47 rpmi: publicsuffix-list-dafsa-20200720-alt1 sisyphus+255208.100.1.1 1595349910 installed <13>Jul 25 10:37:47 rpmi: libpsl-0.21.1-alt1 sisyphus+255206.100.1.1 1595348938 installed <13>Jul 25 10:37:47 rpmi: libnghttp2-1.41.0-alt1 sisyphus+253680.100.1.1 1592642271 installed <13>Jul 25 10:37:47 rpmi: libverto-0.3.0-alt1_7 sisyphus+225932.100.1.1 1553994919 installed <13>Jul 25 10:37:47 rpmi: liblmdb-0.9.23-alt1 sisyphus+225277.100.2.1 1553001679 installed <13>Jul 25 10:37:47 rpmi: libkeyutils-1.6-alt2 sisyphus+226520.100.2.1 1554512089 installed <13>Jul 25 10:37:47 rpmi: libcom_err-1.44.6-alt1 sisyphus+224154.100.1.1 1552091678 installed <13>Jul 25 10:37:48 rpmi: poppler-data-0.4.9-alt1 sisyphus.216033.100 1541141723 installed <13>Jul 25 10:37:48 rpmi: libpixman-3:0.40.0-alt1 sisyphus+250700.100.1.1 1587971055 installed <13>Jul 25 10:37:48 rpmi: libbrotlicommon-1.0.7-alt1 sisyphus+226738.100.2.1 1554554568 installed <13>Jul 25 10:37:48 rpmi: libbrotlidec-1.0.7-alt1 sisyphus+226738.100.2.1 1554554568 installed <13>Jul 25 10:37:48 rpmi: libgraphite2-1.3.14-alt2 sisyphus+250009.100.1.1 1586943065 installed <13>Jul 25 10:37:48 rpmi: libharfbuzz-2.6.8-alt1 sisyphus+254028.100.1.1 1593106819 installed <13>Jul 25 10:37:48 rpmi: libfreetype-2.10.2-alt1 sisyphus+251736.100.1.1 1589531905 installed <13>Jul 25 10:37:48 rpmi: fontconfig-2.13.1-alt1 sisyphus+247349.100.1.2 1583841221 installed Updating fonts cache: <29>Jul 25 10:37:49 fontconfig: Updating fonts cache: succeeded [ DONE ] <13>Jul 25 10:37:49 rpmi: fonts-type1-xorg-7.0.0-alt4 1188553211 installed <13>Jul 25 10:37:49 rpmi: fonts-type1-urw-3:1.0.7pre44-alt3 sisyphus+224082.100.2.1 1552406640 installed <13>Jul 25 10:37:49 rpmi: libxshmfence-1.3-alt1 sisyphus+223149.1000.2.1 1551268571 installed <13>Jul 25 10:37:49 rpmi: libpciaccess-1:0.16-alt1 sisyphus+234814.100.1.1 1563438291 installed <13>Jul 25 10:37:49 rpmi: libdrm-1:2.4.102-alt1 sisyphus+252307.100.1.1 1590574828 installed <13>Jul 25 10:37:49 rpmi: libgbm-4:20.1.4-alt1 sisyphus+255250.100.1.1 1595485581 installed <13>Jul 25 10:37:49 rpmi: bc-1:1.07.1-alt1 sisyphus+221902.700.4.1 1550587857 installed <13>Jul 25 10:37:49 rpmi: libatk-locales-2.36.0-alt1 sisyphus+249208.100.1.1 1585840405 installed <13>Jul 25 10:37:49 rpmi: libatk-2.36.0-alt1 sisyphus+249208.100.1.1 1585840406 installed <13>Jul 25 10:37:49 rpmi: shared-mime-info-2.0-alt1 sisyphus+251302.100.1.1 1588847607 installed <13>Jul 25 10:37:50 rpmi: gsettings-desktop-schemas-data-3.36.1-alt1 sisyphus+250870.100.1.1 1588227108 installed <13>Jul 25 10:37:50 rpmi: libgio-2.64.4-alt1 sisyphus+254365.100.1.1 1593701078 installed <13>Jul 25 10:37:50 rpmi: gsettings-desktop-schemas-3.36.1-alt1 sisyphus+250870.100.1.1 1588227105 installed <13>Jul 25 10:37:50 rpmi: libgdk-pixbuf-2.40.0-alt1 sisyphus+238952.140.2.1 1570644615 installed <13>Jul 25 10:37:50 rpmi: gtk-update-icon-cache-3.24.21-alt1 sisyphus+254255.100.1.1 1593514352 installed <13>Jul 25 10:37:50 rpmi: libgusb-0.3.4-alt1 sisyphus+247875.100.1.1 1584292779 installed <13>Jul 25 10:37:50 rpmi: libcolord-1.4.4-alt2 sisyphus+229904.100.1.1 1558606569 installed <13>Jul 25 10:37:50 rpmi: libdconf-0.36.0-alt1 sisyphus+247780.1000.3.2 1584199861 installed <13>Jul 25 10:37:50 rpmi: libjson-glib-1.4.4-alt1 sisyphus.213175.100 1537249589 installed <13>Jul 25 10:37:50 rpmi: liblz4-1:1.9.2-alt1 sisyphus+238585.100.2.2 1570066927 installed <13>Jul 25 10:37:50 rpmi: libgpg-error-1.36-alt1 sisyphus+225621.300.1.1 1553521082 installed <13>Jul 25 10:37:50 rpmi: libgcrypt20-1.8.5-alt3 sisyphus+239622.100.1.1 1571746654 installed <13>Jul 25 10:37:50 rpmi: libsystemd-1:245.6-alt2 sisyphus+254630.100.3.1 1594643276 installed <13>Jul 25 10:37:50 rpmi: libdbus-1.12.18-alt1 sisyphus+252758.100.1.1 1591203693 installed <13>Jul 25 10:37:50 rpmi: libavahi-0.8-alt1 sisyphus+255349.240.4.1 1595604514 installed <13>Jul 25 10:37:50 rpmi: libcups-2.3.1-alt1 sisyphus+247381.100.2.2 1583841455 installed <13>Jul 25 10:37:51 rpmi: libgs-9.28-alt0.rc1.1 sisyphus+237325.100.1.1 1568104012 installed <13>Jul 25 10:37:52 rpmi: ghostscript-common-9.28-alt0.rc1.1 sisyphus+237325.100.1.1 1568103940 installed <13>Jul 25 10:37:52 rpmi: ghostscript-classic-9.28-alt0.rc1.1 sisyphus+237325.100.1.1 1568104012 installed <13>Jul 25 10:37:52 rpmi: cups-filters-libs-1.27.5-alt1 sisyphus+253475.100.1.1 1592227620 installed <13>Jul 25 10:37:52 rpmi: libavahi-glib-0.8-alt1 sisyphus+255349.240.4.1 1595604514 installed <13>Jul 25 10:37:52 rpmi: dbus-tools-1.12.18-alt1 sisyphus+252758.100.1.1 1591203693 installed <86>Jul 25 10:37:52 groupadd[4177532]: group added to /etc/group: name=messagebus, GID=499 <86>Jul 25 10:37:52 groupadd[4177532]: group added to /etc/gshadow: name=messagebus <86>Jul 25 10:37:52 groupadd[4177532]: new group: name=messagebus, GID=499 <86>Jul 25 10:37:52 useradd[4177555]: new user: name=messagebus, UID=499, GID=499, home=/run/dbus, shell=/dev/null <13>Jul 25 10:37:52 rpmi: dbus-1.12.18-alt1 sisyphus+252758.100.1.1 1591203693 installed <13>Jul 25 10:37:52 rpmi: dconf-0.36.0-alt1 sisyphus+247780.1000.3.2 1584199861 installed <13>Jul 25 10:37:52 rpmi: libgtk+3-schemas-3.24.21-alt1 sisyphus+254255.100.1.1 1593514263 installed <13>Jul 25 10:37:52 rpmi: libpolkit-0.116-alt3 sisyphus+253546.100.1.1 1592424198 installed <86>Jul 25 10:37:52 groupadd[4177747]: group added to /etc/group: name=colord, GID=498 <86>Jul 25 10:37:52 groupadd[4177747]: group added to /etc/gshadow: name=colord <86>Jul 25 10:37:52 groupadd[4177747]: new group: name=colord, GID=498 <86>Jul 25 10:37:52 useradd[4177764]: new user: name=colord, UID=498, GID=498, home=/var/colord, shell=/dev/null <13>Jul 25 10:37:52 rpmi: colord-1.4.4-alt2 sisyphus+229904.100.1.1 1558606569 installed <13>Jul 25 10:37:52 rpmi: libxslt-1.1.34-alt2 sisyphus+248264.100.1.1 1584829770 installed <13>Jul 25 10:37:52 rpmi: libX11-locales-3:1.6.9-alt1 sisyphus+239210.100.1.1 1571056781 installed <13>Jul 25 10:37:52 rpmi: libXdmcp-1.1.3-alt1 sisyphus+225206.600.1.2 1552949353 installed <13>Jul 25 10:37:52 rpmi: libXau-1.0.9-alt1 sisyphus+223149.200.2.1 1551268152 installed <13>Jul 25 10:37:52 rpmi: libxcb-1.14-alt1 sisyphus+247358.200.1.3 1583854228 installed <13>Jul 25 10:37:52 rpmi: libX11-3:1.6.9-alt1 sisyphus+239210.100.1.1 1571056801 installed <13>Jul 25 10:37:52 rpmi: libXext-1.3.4-alt1 sisyphus+225206.700.1.2 1552949429 installed <13>Jul 25 10:37:52 rpmi: libXrender-0.9.8-alt1 1371312112 installed <13>Jul 25 10:37:52 rpmi: libXi-1.7.10-alt1 sisyphus+232786.300.1.1 1561106978 installed <13>Jul 25 10:37:52 rpmi: libXfixes-5.0.3-alt1 sisyphus+226736.100.2.2 1554614841 installed <13>Jul 25 10:37:52 rpmi: libXtst-1.2.2-alt1 1369984893 installed <13>Jul 25 10:37:52 rpmi: libXdamage-1.1.5-alt1 sisyphus+225206.500.1.2 1552949286 installed <13>Jul 25 10:37:52 rpmi: libXcomposite-0.4.5-alt1 sisyphus+225206.300.1.2 1552949137 installed <13>Jul 25 10:37:52 rpmi: libXcursor-1.2.0-alt1 sisyphus+225206.400.1.2 1552949218 installed <13>Jul 25 10:37:52 rpmi: libXrandr-1.5.2-alt1 sisyphus+225206.1300.1.2 1552949710 installed <13>Jul 25 10:37:52 rpmi: libXinerama-1.1.4-alt1 sisyphus+223149.300.2.1 1551268216 installed <13>Jul 25 10:37:52 rpmi: libat-spi2-core-2.36.0-alt1 sisyphus+247780.1600.3.2 1584200495 installed <13>Jul 25 10:37:52 rpmi: libXft-2.3.3-alt1 sisyphus+225206.1000.3.2 1552987708 installed <13>Jul 25 10:37:52 rpmi: libXxf86vm-1.1.4-alt2 1527672187 installed <13>Jul 25 10:37:52 rpmi: libGLX-mesa-4:20.1.4-alt1 sisyphus+255250.100.1.1 1595485581 installed <13>Jul 25 10:37:52 rpmi: libEGL-mesa-4:20.1.4-alt1 sisyphus+255250.100.1.1 1595485581 installed <13>Jul 25 10:37:52 rpmi: libEGL-7:1.3.2-alt1 sisyphus+254610.100.1.1 1594124268 installed <13>Jul 25 10:37:52 rpmi: libGLX-7:1.3.2-alt1 sisyphus+254610.100.1.1 1594124268 installed <13>Jul 25 10:37:52 rpmi: libGL-7:1.3.2-alt1 sisyphus+254610.100.1.1 1594124268 installed <13>Jul 25 10:37:52 rpmi: libcairo-1:1.16.0-alt1 sisyphus+226534.100.2.3 1554515535 installed <13>Jul 25 10:37:52 rpmi: libpango-1.44.7-alt1 sisyphus+239731.100.1.1 1571986949 installed <13>Jul 25 10:37:52 rpmi: libgtk+2-2.24.32-alt4 sisyphus+248211.200.2.1 1584869549 installed <13>Jul 25 10:37:52 rpmi: libgail-2.24.32-alt4 sisyphus+248211.200.2.1 1584869549 installed <13>Jul 25 10:37:52 rpmi: libcairo-gobject-1:1.16.0-alt1 sisyphus+226534.100.2.3 1554515535 installed <13>Jul 25 10:37:52 rpmi: dbus-tools-gui-1.12.18-alt1 sisyphus+252758.100.1.1 1591203693 installed <13>Jul 25 10:37:52 rpmi: at-spi2-core-2.36.0-alt1 sisyphus+247780.1600.3.2 1584200495 installed <13>Jul 25 10:37:52 rpmi: at-spi2-atk-2.34.2-alt1 sisyphus+247242.200.7.1 1583839952 installed <13>Jul 25 10:37:52 rpmi: libXt-1.2.0-alt1 sisyphus+247690.400.1.1 1584000596 installed <13>Jul 25 10:37:54 rpmi: libxcb-devel-1.14-alt1 sisyphus+247358.200.1.3 1583854228 installed <13>Jul 25 10:37:54 rpmi: libX11-devel-3:1.6.9-alt1 sisyphus+239210.100.1.1 1571056801 installed <13>Jul 25 10:37:54 rpmi: libXt-devel-1.2.0-alt1 sisyphus+247690.400.1.1 1584000596 installed <13>Jul 25 10:37:54 rpmi: rpm-macros-alternatives-0.5.1-alt1 sisyphus+226946.100.1.1 1554830426 installed <13>Jul 25 10:37:54 rpmi: alternatives-0.5.1-alt1 sisyphus+226946.100.1.1 1554830426 installed <13>Jul 25 10:37:54 rpmi: libnss-3.54.0-alt1 sisyphus+254237.200.1.1 1593450935 installed <13>Jul 25 10:37:54 rpmi: ca-certificates-2020.06.29-alt1 sisyphus+254237.300.1.1 1593450881 installed <13>Jul 25 10:37:54 rpmi: ca-trust-0.1.2-alt1 sisyphus+233348.100.1.1 1561653823 installed <13>Jul 25 10:37:54 rpmi: p11-kit-trust-0.23.15-alt2 sisyphus+252784.100.2.2 1591274901 installed <13>Jul 25 10:37:54 rpmi: libcrypto1.1-1.1.1g-alt1 sisyphus+249982.60.8.1 1587743711 installed <13>Jul 25 10:37:54 rpmi: libssl1.1-1.1.1g-alt1 sisyphus+249982.60.8.1 1587743711 installed <13>Jul 25 10:37:54 rpmi: libpython3-3.8.5-alt1 sisyphus+244405.100.3.1 1595544514 installed <13>Jul 25 10:37:54 rpmi: python3-3.8.5-alt1 sisyphus+244405.100.3.1 1595544514 installed <13>Jul 25 10:37:55 rpmi: python3-base-3.8.5-alt1 sisyphus+244405.100.3.1 1595544514 installed <86>Jul 25 10:37:55 groupadd[4187318]: group added to /etc/group: name=_keytab, GID=497 <86>Jul 25 10:37:55 groupadd[4187318]: group added to /etc/gshadow: name=_keytab <86>Jul 25 10:37:55 groupadd[4187318]: new group: name=_keytab, GID=497 <13>Jul 25 10:37:55 rpmi: libkrb5-1.18.2-alt2 sisyphus+254565.100.4.1 1594375666 installed <13>Jul 25 10:37:55 rpmi: python3-module-sugarbowl-0.52.1-alt1.git20141130.1.1 sisyphus+227470.1100.1.1 1555687657 installed <13>Jul 25 10:37:55 rpmi: python3-module-six-1.14.0-alt1 sisyphus+251567.100.1.1 1589268039 installed <13>Jul 25 10:37:55 rpmi: ca-trust-java-0.1.2-alt1 sisyphus+233348.100.1.1 1561653823 installed <13>Jul 25 10:37:58 rpmi: java-1.8.0-openjdk-headless-0:1.8.0.212.b04-alt2_0jpp8 sisyphus+234504.100.1.1 1563098253 installed <86>Jul 25 10:37:58 groupadd[3743]: group added to /etc/group: name=sasl, GID=496 <86>Jul 25 10:37:58 groupadd[3743]: group added to /etc/gshadow: name=sasl <86>Jul 25 10:37:58 groupadd[3743]: new group: name=sasl, GID=496 <13>Jul 25 10:37:58 rpmi: libsasl2-3-2.1.27-alt2 sisyphus+228101.100.1.1 1556139863 installed <13>Jul 25 10:37:58 rpmi: libldap-2.4.48-alt3 sisyphus+238816.100.1.1 1570449022 installed <13>Jul 25 10:37:58 rpmi: libcurl-7.71.1-alt1 sisyphus+254403.100.1.1 1593776636 installed <13>Jul 25 10:37:58 rpmi: libpoppler97-0.86.1-alt1 sisyphus+247631.100.1.1 1583927460 installed <13>Jul 25 10:37:59 rpmi: poppler-0.86.1-alt1 sisyphus+247631.100.1.1 1583927460 installed <13>Jul 25 10:37:59 rpmi: libpoppler0-cpp-0.86.1-alt1 sisyphus+247631.100.1.1 1583927460 installed <13>Jul 25 10:37:59 rpmi: cups-filters-1.27.5-alt1 sisyphus+253475.100.1.1 1592227620 installed <13>Jul 25 10:37:59 rpmi: cups-2.3.1-alt1 sisyphus+247381.100.2.2 1583841455 installed <13>Jul 25 10:38:02 rpmi: java-10-openjdk-headless-0:10.0.2.13-alt1_7jpp9 sisyphus+234186.100.1.2 1562726544 installed <13>Jul 25 10:38:03 rpmi: python3-module-markupsafe-1.1.1-alt1 sisyphus+248369.100.1.1 1585046136 installed <13>Jul 25 10:38:03 rpmi: python3-module-jinja2-2.11.2-alt1 sisyphus+254573.100.1.1 1594043344 installed <13>Jul 25 10:38:03 rpmi: python3-module-clyde-0.8.0-alt1.git20141130.2.1 sisyphus+227465.1600.1.2 1555756906 installed <13>Jul 25 10:38:03 rpmi: python3-module-pkg_resources-1:46.1.3-alt1 sisyphus+250566.200.3.1 1587973342 installed <13>Jul 25 10:38:03 rpmi: python3-module-runfile-0.46.1-alt1.git20141130.2.1 sisyphus+227469.1300.2.3 1555706376 installed <13>Jul 25 10:38:03 rpmi: objectweb-asm-0:7.0-alt1_4jpp8 sisyphus+246362.100.1.3 1581801326 installed <13>Jul 25 10:38:03 rpmi: xmvn-install-3.0.0-alt1_23jpp8 sisyphus+234592.200.1.1 1563216657 installed <13>Jul 25 10:38:03 rpmi: xmvn-subst-3.0.0-alt1_23jpp8 sisyphus+234592.200.1.1 1563216657 installed <13>Jul 25 10:38:03 rpmi: xmvn-resolve-3.0.0-alt1_23jpp8 sisyphus+234592.200.1.1 1563216657 installed <13>Jul 25 10:38:03 rpmi: xml-commons-resolver-0:1.2-alt1_29jpp8 sisyphus+246085.100.1.1 1581616616 installed <13>Jul 25 10:38:03 rpmi: xalan-j2-0:2.7.1-alt4_39jpp8 sisyphus+230759.100.1.3 1559127607 installed <13>Jul 25 10:38:03 rpmi: xerces-j2-0:2.12.0-alt1_4jpp8 sisyphus+246082.100.1.1 1581615230 installed <13>Jul 25 10:38:03 rpmi: python3-module-genshi-0.7-alt2 sisyphus+229363.100.1.1 1557847335 installed <13>Jul 25 10:38:03 rpmi: python3-module-webencodings-0.5.1-alt2 sisyphus+245915.200.1.1 1581496105 installed <13>Jul 25 10:38:03 rpmi: python3-module-cssselect-0.9.1-alt2 sisyphus+250566.2300.6.1 1588188959 installed <13>Jul 25 10:38:03 rpmi: python3-module-html5lib-1:1.0.1-alt1 sisyphus+238807.100.2.1 1570465973 installed <13>Jul 25 10:38:03 rpmi: python3-module-lxml-4.5.0-alt2 sisyphus+250566.2700.6.1 1588189778 installed <13>Jul 25 10:38:04 rpmi: python3-module-javapackages-1:5.3.0-alt1_4jpp8 sisyphus+234467.100.1.1 1563037789 installed <13>Jul 25 10:38:04 rpmi: rpm-build-java-1:5.3.0-alt1_4jpp8 sisyphus+234467.100.1.1 1563037789 installed <13>Jul 25 10:38:04 rpmi: glib-networking-2.64.3-alt1 sisyphus+251581.1300.3.3 1590773671 installed <13>Jul 25 10:38:04 rpmi: libsoup-2.70.0-alt1 sisyphus+247780.1300.3.2 1584200094 installed <13>Jul 25 10:38:04 rpmi: libsoup-gnome-2.70.0-alt1 sisyphus+247780.1300.3.2 1584200094 installed <13>Jul 25 10:38:04 rpmi: librest-0.8.1-alt1 1508266400 installed <13>Jul 25 10:38:04 rpmi: libgtk+3-3.24.21-alt1 sisyphus+254255.100.1.1 1593514352 installed <13>Jul 25 10:38:04 rpmi: gtk3-demo-3.24.21-alt1 sisyphus+254255.100.1.1 1593514352 installed <13>Jul 25 10:38:04 rpmi: libgail3-3.24.21-alt1 sisyphus+254255.100.1.1 1593514352 installed <13>Jul 25 10:38:04 rpmi: java-stub-javadoc-0.1-alt1 1229813340 installed <13>Jul 25 10:38:04 rpmi: alsa-ucm-conf-1.2.3-alt1 sisyphus+253139.200.1.1 1591812001 installed <13>Jul 25 10:38:04 rpmi: alsa-topology-conf-1.2.3-alt1 sisyphus+253139.100.1.1 1591811985 installed <13>Jul 25 10:38:04 rpmi: libalsa-1:1.2.3.2-alt1 sisyphus+254690.100.1.1 1594280126 installed <13>Jul 25 10:38:04 rpmi: java-1.8.0-openjdk-0:1.8.0.212.b04-alt2_0jpp8 sisyphus+234504.100.1.1 1563098253 installed <13>Jul 25 10:38:05 rpmi: java-1.8.0-openjdk-devel-0:1.8.0.212.b04-alt2_0jpp8 sisyphus+234504.100.1.1 1563098253 installed <13>Jul 25 10:38:05 rpmi: java-10-openjdk-0:10.0.2.13-alt1_7jpp9 sisyphus+234186.100.1.2 1562726544 installed <13>Jul 25 10:38:06 rpmi: java-10-openjdk-devel-0:10.0.2.13-alt1_7jpp9 sisyphus+234186.100.1.2 1562726544 installed <13>Jul 25 10:38:06 rpmi: jpackage-generic-compat-0.30-alt1 sisyphus+234288.100.1.1 1562847521 installed <13>Jul 25 10:38:06 rpmi: javapackages-local-1:5.3.0-alt1_4jpp8 sisyphus+234467.100.1.1 1563037789 installed <13>Jul 25 10:38:06 rpmi: nekohtml-0:1.9.22-alt1_11jpp8 sisyphus+246358.100.1.1 1581799490 installed <13>Jul 25 10:38:06 rpmi: ant-0:1.10.5-alt1_5jpp8 sisyphus+232747.100.2.1 1561092977 installed Building target platforms: i586 Building for target i586 Wrote: /usr/src/in/nosrpm/boilerpipe-1.2.0-alt1_13jpp8.nosrc.rpm Installing boilerpipe-1.2.0-alt1_13jpp8.src.rpm Building target platforms: i586 Building for target i586 Executing(%prep): /bin/sh -e /usr/src/tmp/rpm-tmp.65928 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + rm -rf boilerpipe-1.2.0 + echo 'Source #0 (boilerpipe-1.2.0-src.tar.gz):' Source #0 (boilerpipe-1.2.0-src.tar.gz): + /bin/gzip -dc /usr/src/RPM/SOURCES/boilerpipe-1.2.0-src.tar.gz + /bin/tar -xf - + cd boilerpipe-1.2.0 + /bin/chmod -c -Rf u+rwX,go-w . + find . -iname '*.jar' -delete + find . -iname '*.class' -delete + echo 'Patch #0 (boilerpipe-1.2.0-libdir-patch):' Patch #0 (boilerpipe-1.2.0-libdir-patch): + /usr/bin/patch -p0 patching file build.xml + cp /usr/src/RPM/SOURCES/boilerpipe-1.2.0.pom pom.xml + echo 'Patch #1 (boilerpipe-1.2.0-nekohtml-patch):' Patch #1 (boilerpipe-1.2.0-nekohtml-patch): + /usr/bin/patch -p1 patching file pom.xml patching file src/main/org/cyberneko/html/HTMLElements.java patching file src/main/org/cyberneko/html/HTMLTagBalancer.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextBlock.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/document/TextDocument.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/TagAction.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + exit 0 Executing(%build): /bin/sh -e /usr/src/tmp/rpm-tmp.68118 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + cd boilerpipe-1.2.0 + ant -Dapp.javaversion=1.6 Buildfile: /usr/src/RPM/BUILD/boilerpipe-1.2.0/build.xml clean: [mkdir] Created dir: /usr/src/RPM/BUILD/boilerpipe-1.2.0/javadoc/1.2 init: [mkdir] Created dir: /usr/src/RPM/BUILD/boilerpipe-1.2.0/build [mkdir] Created dir: /usr/src/RPM/BUILD/boilerpipe-1.2.0/build/main [mkdir] Created dir: /usr/src/RPM/BUILD/boilerpipe-1.2.0/build/demo [mkdir] Created dir: /usr/src/RPM/BUILD/boilerpipe-1.2.0/dist javadoc: [javadoc] Generating Javadoc [javadoc] Javadoc execution [javadoc] Loading source files for package de.l3s.boilerpipe... [javadoc] Loading source files for package de.l3s.boilerpipe.conditions... [javadoc] Loading source files for package de.l3s.boilerpipe.document... [javadoc] Loading source files for package de.l3s.boilerpipe.estimators... [javadoc] Loading source files for package de.l3s.boilerpipe.extractors... [javadoc] Loading source files for package de.l3s.boilerpipe.filters.english... [javadoc] Loading source files for package de.l3s.boilerpipe.filters.heuristics... [javadoc] Loading source files for package de.l3s.boilerpipe.filters.simple... [javadoc] Loading source files for package de.l3s.boilerpipe.labels... [javadoc] Loading source files for package de.l3s.boilerpipe.sax... [javadoc] Loading source files for package de.l3s.boilerpipe.util... [javadoc] Constructing Javadoc information... [javadoc] Standard Doclet version 1.8.0_212 [javadoc] Building tree for all the packages and classes... [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java:21: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java:33: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java:44: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java:54: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/BoilerpipeFilter.java:36: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/BoilerpipeInput.java:32: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java:33: warning: no description for @param [javadoc] * @param tb [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java:34: error: malformed HTML [javadoc] * @return iff the condition is met. [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/document/TextBlock.java:252: warning: no description for @return [javadoc] * @return [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/document/TextDocument.java:78: warning: no description for @param [javadoc] * @param title [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java:46: warning: no description for @param [javadoc] * @param dsBefore [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java:47: warning: no description for @param [javadoc] * @param dsAfter [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java:43: warning: no @return [javadoc] public static ArticleExtractor getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java:47: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java:64: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java:83: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java:98: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java:109: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java:36: warning: no @return [javadoc] public static ArticleSentencesExtractor getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java:43: warning: no @return [javadoc] public static CanolaExtractor getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java:37: warning: no @return [javadoc] public static DefaultExtractor getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java:42: warning: no @return [javadoc] public static LargestContentExtractor getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java:36: warning: no @return [javadoc] public static NumWordsRulesExtractor getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java:43: warning: no @return [javadoc] public static DensityRulesClassifier getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java:47: warning: no @return [javadoc] public static IgnoreBlocksAfterContentFilter getDefaultInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java:42: warning: no @return [javadoc] public static NumWordsRulesClassifier getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java:40: warning: no @return [javadoc] public static TerminatingBlocksFinder getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java:44: error: @param name not found [javadoc] * @param maxBlocksDistance The maximum distance in blocks. [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java:45: error: @param name not found [javadoc] * @param contentOnly [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java:45: warning: no description for @param [javadoc] * @param contentOnly [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java:47: warning: no @param for labelPrefix [javadoc] public AddPrecedingLabelsFilter(final String labelPrefix) { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java:55: warning: no description for @param [javadoc] * @param contentOnly [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java:57: warning: no @param for sameTagLevelOnly [javadoc] public BlockProximityFusion(final int maxBlocksDistance, [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java:40: warning: no @return [javadoc] public static ExpandTitleToContentFilter getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java:45: error: @param name not found [javadoc] * @param maxBlocksDistance The maximum distance in blocks. [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java:46: error: @param name not found [javadoc] * @param contentOnly [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java:46: warning: no description for @param [javadoc] * @param contentOnly [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java:48: warning: no @param for labelPrefix [javadoc] public LabelFusion(final String labelPrefix) { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java:39: warning: no @return [javadoc] public static SimpleBlockFusionProcessor getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java:39: warning: no @return [javadoc] public static BoilerplateBlockFilter getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java:45: warning: no @return [javadoc] public static SplitParagraphBlocksFilter getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java:47: warning: no description for @param [javadoc] * @param contentHandler [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java:59: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java:40: warning: no description for @param [javadoc] * @param is [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java:41: warning: no description for @throws [javadoc] * @throws SAXException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java:27: warning: no description for @param [javadoc] * @param url [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java:28: warning: no description for @return [javadoc] * @return [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java:29: warning: no description for @throws [javadoc] * @throws IOException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:54: warning: no @return [javadoc] public static HTMLHighlighter newHighlightingInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:62: warning: no @return [javadoc] public static HTMLHighlighter newExtractingInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:88: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:90: warning: no @return [javadoc] public String process(final TextDocument doc, final String origHTML) [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:103: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:105: warning: no @return [javadoc] public String process(final TextDocument doc, final InputSource is) [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:162: warning: no @return [javadoc] public boolean isOutputHighlightOnly() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:170: warning: no @param for outputHighlightOnly [javadoc] public void setOutputHighlightOnly(boolean outputHighlightOnly) { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:181: warning: no @return [javadoc] public String getExtraStyleSheet() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:203: error: invalid entity &qupt; [javadoc] * <span class=&qupt;x-boilerpipe-mark1"> [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:205: warning: no @return [javadoc] public String getPreHighlight() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:215: warning: no @param for preHighlight [javadoc] public void setPreHighlight(String preHighlight) { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:225: warning: no @return [javadoc] public String getPostHighlight() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:234: warning: no @param for postHighlight [javadoc] public void setPostHighlight(String postHighlight) { [javadoc] ^ [javadoc] Building index for all the packages and classes... [javadoc] Building index for all classes... [javadoc] Generating /usr/src/RPM/BUILD/boilerpipe-1.2.0/javadoc/1.2/help-doc.html... [javadoc] 6 errors [javadoc] 56 warnings compile: [javac] /usr/src/RPM/BUILD/boilerpipe-1.2.0/build.xml:93: warning: 'includeantruntime' was not set, defaulting to build.sysclasspath=last; set to false for repeatable builds [javac] Compiling 62 source files to /usr/src/RPM/BUILD/boilerpipe-1.2.0/build/main [javac] warning: [options] bootstrap class path not set in conjunction with -source 1.6 [javac] 1 warning [javac] /usr/src/RPM/BUILD/boilerpipe-1.2.0/build.xml:94: warning: 'includeantruntime' was not set, defaulting to build.sysclasspath=last; set to false for repeatable builds [javac] Compiling 3 source files to /usr/src/RPM/BUILD/boilerpipe-1.2.0/build/demo [javac] warning: [options] bootstrap class path not set in conjunction with -source 1.6 [javac] 1 warning jars: [jar] Building jar: /usr/src/RPM/BUILD/boilerpipe-1.2.0/dist/boilerpipe-1.2.0.jar [jar] Building jar: /usr/src/RPM/BUILD/boilerpipe-1.2.0/dist/boilerpipe-demo-1.2.0.jar [jar] Building jar: /usr/src/RPM/BUILD/boilerpipe-1.2.0/dist/boilerpipe-javadoc-1.2.0.jar [jar] Building jar: /usr/src/RPM/BUILD/boilerpipe-1.2.0/dist/boilerpipe-sources-1.2.0.jar dist: [tar] Building tar: /usr/src/RPM/BUILD/boilerpipe-1.2.0/dist/boilerpipe-1.2.0-bin.tar.gz [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/extractors/class-use/KeepEverythingWithMinKWordsExtractor.html longer than 100 characters. [tar] Resulting tar file can only be processed successfully by GNU compatible tar commands [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/class-use/DensityRulesClassifier.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/class-use/IgnoreBlocksAfterContentFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/class-use/IgnoreBlocksAfterContentFromEndFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/class-use/KeepLargestFulltextBlockFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/class-use/MinFulltextWordsFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/class-use/NumWordsRulesClassifier.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/class-use/TerminatingBlocksFinder.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/heuristics/class-use/AddPrecedingLabelsFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/heuristics/class-use/ArticleMetadataFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/heuristics/class-use/BlockProximityFusion.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/heuristics/class-use/DocumentTitleMatchClassifier.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/heuristics/class-use/ExpandTitleToContentFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/heuristics/class-use/KeepLargestBlockFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/heuristics/class-use/SimpleBlockFusionProcessor.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/simple/class-use/LabelToBoilerplateFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/simple/class-use/MarkEverythingContentFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/simple/class-use/SplitParagraphBlocksFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/simple/class-use/SurroundingToContentFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/sax/class-use/CommonTagActions.BlockTagLabelAction.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/sax/class-use/CommonTagActions.InlineTagLabelAction.html longer than 100 characters. [tar] Building tar: /usr/src/RPM/BUILD/boilerpipe-1.2.0/dist/boilerpipe-1.2.0-src.tar.gz [tar] Entry: boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java longer than 100 characters. [tar] Resulting tar file can only be processed successfully by GNU compatible tar commands BUILD SUCCESSFUL Total time: 12 seconds + exit 0 Executing(%install): /bin/sh -e /usr/src/tmp/rpm-tmp.43064 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + /bin/chmod -Rf u+rwX -- /usr/src/tmp/boilerpipe-buildroot + : + /bin/rm -rf -- /usr/src/tmp/boilerpipe-buildroot + cd boilerpipe-1.2.0 + /usr/bin/python3 /usr/share/java-utils/mvn_artifact.py pom.xml dist/boilerpipe-1.2.0.jar + /usr/bin/python3 /usr/share/java-utils/mvn_file.py de.l3s.boilerpipe:boilerpipe boilerpipe + xmvn-install -R .xmvn-reactor -n boilerpipe -d /usr/src/tmp/boilerpipe-buildroot [INFO] Installing artifact de.l3s.boilerpipe:boilerpipe:pom:1.2.0 [INFO] Installing artifact de.l3s.boilerpipe:boilerpipe:jar:1.2.0 [INFO] Installation successful + jdir=javadoc/1.2 + /bin/mkdir -p /usr/src/tmp/boilerpipe-buildroot/usr/share/license + '[' -d javadoc/1.2 ']' + install -dm755 /usr/src/tmp/boilerpipe-buildroot/usr/share/javadoc/boilerpipe + cp -pr javadoc/1.2/allclasses-frame.html javadoc/1.2/allclasses-noframe.html javadoc/1.2/constant-values.html javadoc/1.2/de javadoc/1.2/deprecated-list.html javadoc/1.2/help-doc.html javadoc/1.2/index-all.html javadoc/1.2/index.html javadoc/1.2/overview-frame.html javadoc/1.2/overview-summary.html javadoc/1.2/overview-tree.html javadoc/1.2/package-list javadoc/1.2/script.js javadoc/1.2/serialized-form.html javadoc/1.2/stylesheet.css /usr/src/tmp/boilerpipe-buildroot/usr/share/javadoc/boilerpipe + echo /usr/share/javadoc/boilerpipe + install -pm 644 dist/boilerpipe-demo-1.2.0.jar /usr/src/tmp/boilerpipe-buildroot/usr/share/java/boilerpipe-demo.jar + /usr/lib/rpm/brp-alt Cleaning files in /usr/src/tmp/boilerpipe-buildroot (auto) Verifying and fixing files in /usr/src/tmp/boilerpipe-buildroot (binconfig,pkgconfig,libtool,desktop) Checking contents of files in /usr/src/tmp/boilerpipe-buildroot/ (default) Compressing files in /usr/src/tmp/boilerpipe-buildroot (auto) Verifying ELF objects in /usr/src/tmp/boilerpipe-buildroot (arch=normal,fhs=normal,lfs=relaxed,lint=relaxed,rpath=normal,stack=normal,textrel=normal,unresolved=normal) Hardlinking identical .pyc and .pyo files Processing files: boilerpipe-1.2.0-alt1_13jpp8 Executing(%doc): /bin/sh -e /usr/src/tmp/rpm-tmp.20206 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + cd boilerpipe-1.2.0 + DOCDIR=/usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-1.2.0 + export DOCDIR + rm -rf /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-1.2.0 + /bin/mkdir -p /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-1.2.0 + cp -prL --no-dereference LICENSE.txt NOTICE.txt /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-1.2.0 + chmod -R go-w /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-1.2.0 + chmod -R a+rX /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-1.2.0 + exit 0 Finding Provides (using /usr/lib/rpm/find-provides) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.0hYpj2 find-provides: running scripts (alternatives,debuginfo,lib,maven,osgi-fc,pam,perl,pkgconfig,python,shell) [INFO maven.prov] input: ['/usr/src/tmp/boilerpipe-buildroot/usr/share/maven-metadata/boilerpipe.xml'] [INFO maven.prov] mvn(de.l3s.boilerpipe:boilerpipe) = 1.2.0, mvn(de.l3s.boilerpipe:boilerpipe:pom:) = 1.2.0 [INFO osgi.prov] input: ['/usr/src/tmp/boilerpipe-buildroot/usr/share/java/boilerpipe-demo.jar', '/usr/src/tmp/boilerpipe-buildroot/usr/share/java/boilerpipe.jar'] Finding Requires (using /usr/lib/rpm/find-requires) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.1ZXGU2 find-requires: running scripts (cpp,debuginfo,files,javadoc,lib,maven,osgi-fc,pam,perl,pkgconfig,pkgconfiglib,python,rpmlib,shebang,shell,static,symlinks,systemd-services) [INFO maven.req] input: ['/usr/src/tmp/boilerpipe-buildroot/usr/share/maven-metadata/boilerpipe.xml'] [INFO maven.req] javapackages-filesystem, mvn(net.sourceforge.nekohtml:nekohtml) [INFO osgi.req] input: ['/usr/src/tmp/boilerpipe-buildroot/usr/share/java/boilerpipe-demo.jar', '/usr/src/tmp/boilerpipe-buildroot/usr/share/java/boilerpipe.jar'] Provides: mvn(de.l3s.boilerpipe:boilerpipe) = 1.2.0, mvn(de.l3s.boilerpipe:boilerpipe:pom:) = 1.2.0 Requires: javapackages-filesystem, mvn(net.sourceforge.nekohtml:nekohtml) Processing files: boilerpipe-javadoc-1.2.0-alt1_13jpp8 Executing(%doc): /bin/sh -e /usr/src/tmp/rpm-tmp.96082 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + cd boilerpipe-1.2.0 + DOCDIR=/usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-javadoc-1.2.0 + export DOCDIR + rm -rf /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-javadoc-1.2.0 + /bin/mkdir -p /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-javadoc-1.2.0 + cp -prL --no-dereference LICENSE.txt NOTICE.txt /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-javadoc-1.2.0 + chmod -R go-w /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-javadoc-1.2.0 + chmod -R a+rX /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-javadoc-1.2.0 + exit 0 Finding Provides (using /usr/lib/rpm/find-provides) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.CH97w5 find-provides: running scripts (alternatives,debuginfo,lib,maven,osgi-fc,pam,perl,pkgconfig,python,shell) Finding Requires (using /usr/lib/rpm/find-requires) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.cdskP5 find-requires: running scripts (cpp,debuginfo,files,javadoc,lib,maven,osgi-fc,pam,perl,pkgconfig,pkgconfiglib,python,rpmlib,shebang,shell,static,symlinks,systemd-services) Requires: javapackages-filesystem Wrote: /usr/src/RPM/RPMS/noarch/boilerpipe-1.2.0-alt1_13jpp8.noarch.rpm Wrote: /usr/src/RPM/RPMS/noarch/boilerpipe-javadoc-1.2.0-alt1_13jpp8.noarch.rpm 23.39user 1.58system 0:40.77elapsed 61%CPU (0avgtext+0avgdata 135244maxresident)k 0inputs+0outputs (0major+379550minor)pagefaults 0swaps /.out/boilerpipe-1.2.0-alt1_13jpp8.noarch.rpm: license not found in '/usr/share/license' directory: ASL /.out/boilerpipe-1.2.0-alt1_13jpp8.noarch.rpm: license not found in '/usr/share/license' directory: 2.0 /.out/boilerpipe-javadoc-1.2.0-alt1_13jpp8.noarch.rpm: license not found in '/usr/share/license' directory: ASL /.out/boilerpipe-javadoc-1.2.0-alt1_13jpp8.noarch.rpm: license not found in '/usr/share/license' directory: 2.0 48.96user 7.18system 1:15.54elapsed 74%CPU (0avgtext+0avgdata 135244maxresident)k 0inputs+0outputs (0major+1158514minor)pagefaults 0swaps --- boilerpipe-1.2.0-alt1_13jpp8.noarch.rpm.repo 2019-05-26 22:26:46.000000000 +0000 +++ boilerpipe-1.2.0-alt1_13jpp8.noarch.rpm.hasher 2020-07-25 10:38:54.799512479 +0000 @@ -7,3 +7,3 @@ /usr/share/maven-poms/boilerpipe.pom 100644 -Requires: javapackages-tools +Requires: javapackages-filesystem Requires: mvn(net.sourceforge.nekohtml:nekohtml) --- boilerpipe-javadoc-1.2.0-alt1_13jpp8.noarch.rpm.repo 2019-05-26 22:26:46.000000000 +0000 +++ boilerpipe-javadoc-1.2.0-alt1_13jpp8.noarch.rpm.hasher 2020-07-25 10:38:54.865512690 +0000 @@ -215,3 +215,3 @@ /usr/share/javadoc/boilerpipe/stylesheet.css 100644 -Requires: javapackages-tools +Requires: javapackages-filesystem Requires: rpmlib(PayloadIsLzma)