<86>Aug 14 04:18:56 userdel[4132221]: delete user 'rooter' <86>Aug 14 04:18:56 userdel[4132221]: removed group 'rooter' owned by 'rooter' <86>Aug 14 04:18:56 groupadd[4132230]: group added to /etc/group: name=rooter, GID=639 <86>Aug 14 04:18:56 groupadd[4132230]: group added to /etc/gshadow: name=rooter <86>Aug 14 04:18:56 groupadd[4132230]: new group: name=rooter, GID=639 <86>Aug 14 04:18:56 useradd[4132240]: new user: name=rooter, UID=639, GID=639, home=/root, shell=/bin/bash <86>Aug 14 04:18:56 userdel[4132250]: delete user 'builder' <86>Aug 14 04:18:56 userdel[4132250]: removed group 'builder' owned by 'builder' <86>Aug 14 04:18:56 userdel[4132250]: removed shadow group 'builder' owned by 'builder' <86>Aug 14 04:18:56 groupadd[4132257]: group added to /etc/group: name=builder, GID=640 <86>Aug 14 04:18:56 groupadd[4132257]: group added to /etc/gshadow: name=builder <86>Aug 14 04:18:56 groupadd[4132257]: new group: name=builder, GID=640 <86>Aug 14 04:18:56 useradd[4132261]: new user: name=builder, UID=640, GID=640, home=/usr/src, shell=/bin/bash /usr/src/in/srpm/boilerpipe-1.2.0-alt1_13jpp8.src.rpm: license not found in '/usr/share/license' directory: ASL /usr/src/in/srpm/boilerpipe-1.2.0-alt1_13jpp8.src.rpm: license not found in '/usr/share/license' directory: 2.0 <13>Aug 14 04:18:57 rpmi: rpm-macros-java-1:5.3.0-alt1_4jpp8 sisyphus+234467.100.1.1 1563037789 installed <13>Aug 14 04:19:00 rpmi: javapackages-filesystem-1:5.3.0-alt1_4jpp8 sisyphus+234467.100.1.1 1563037789 installed <13>Aug 14 04:19:00 rpmi: javapackages-tools-1:5.3.0-alt1_4jpp8 sisyphus+234467.100.1.1 1563037789 installed <13>Aug 14 04:19:00 rpmi: libjpeg-2:2.0.2-alt1 sisyphus+226996.100.1.1 1554902884 installed <13>Aug 14 04:19:00 rpmi: libpng16-1.6.37-alt1 sisyphus+236677.100.2.1 1566917982 installed <13>Aug 14 04:19:00 rpmi: liblcms2-2.11-alt1 sisyphus+253499.100.1.1 1592286997 installed <13>Aug 14 04:19:00 rpmi: libexpat-2.2.9-alt1 sisyphus+252464.200.2.1 1590958863 installed <13>Aug 14 04:19:00 rpmi: perl-HTTP-Date-6.04-alt1 sisyphus+241046.100.1.1 1574192946 installed <13>Aug 14 04:19:00 rpmi: libwayland-client-1.18.0-alt1 sisyphus+255795.100.1.1 1596475645 installed <13>Aug 14 04:19:00 rpmi: beust-jcommander-1.71-alt1_6jpp8 sisyphus+230680.100.1.3 1559093321 installed <13>Aug 14 04:19:00 rpmi: xmvn-api-3.0.0-alt1_23jpp8 sisyphus+234592.200.1.1 1563216657 installed <13>Aug 14 04:19:00 rpmi: xmvn-core-3.0.0-alt1_23jpp8 sisyphus+234592.200.1.1 1563216657 installed <13>Aug 14 04:19:00 rpmi: xml-commons-apis-1.4.01-alt3_29jpp8 sisyphus+246084.100.1.1 1581616535 installed <13>Aug 14 04:19:00 rpmi: liblksctp-1.0.17-alt2 1523113258 installed <13>Aug 14 04:19:00 rpmi: perl-XML-NamespaceSupport-1.12-alt1 1491296348 installed <13>Aug 14 04:19:00 rpmi: libsqlite3-3.32.3-alt1 sisyphus+253798.100.1.1 1592756027 installed <13>Aug 14 04:19:00 rpmi: libidn2-2.3.0-alt1 sisyphus+240846.100.1.2 1573870464 installed <13>Aug 14 04:19:00 rpmi: lksctp-tools-1.0.17-alt2 1523113258 installed <13>Aug 14 04:19:00 rpmi: java-common-1.6.0-alt1 sisyphus+234020.100.1.1 1562437039 installed <13>Aug 14 04:19:00 rpmi: xml-utils-1:2.9.10-alt3 sisyphus+245000.16400.79.1 1583229770 installed <13>Aug 14 04:19:00 rpmi: libpcsclite-1.9.0-alt1 sisyphus+253463.100.1.1 1592202073 installed <13>Aug 14 04:19:00 rpmi: javazi-2020a-alt1 sisyphus+250575.200.1.1 1587740494 installed <13>Aug 14 04:19:00 rpmi: libgif-4.1.6-alt3 1299634266 installed <13>Aug 14 04:19:00 rpmi: libusb-1.0.23-alt1 sisyphus+237317.100.1.1 1568059904 installed <13>Aug 14 04:19:00 rpmi: perl-LWP-MediaTypes-6.04-alt1 sisyphus+225468.100.1.1 1553186684 installed <13>Aug 14 04:19:00 rpmi: perl-Compress-Raw-Zlib-2.095-alt1 sisyphus+255277.100.1.1 1595512157 installed <13>Aug 14 04:19:00 rpmi: perl-libnet-1:3.11-alt1 1511423541 installed <13>Aug 14 04:19:00 rpmi: perl-XML-SAX-Base-1.09-alt1 1494364363 installed <13>Aug 14 04:19:00 rpmi: libfribidi-1.0.10-alt1 sisyphus+254557.100.1.1 1594020354 installed <13>Aug 14 04:19:00 rpmi: libepoxy-1.5.4-alt1 sisyphus+242061.100.1.1 1575190153 installed <13>Aug 14 04:19:00 rpmi: libnettle8-3.6-alt1 sisyphus+251637.100.3.1 1590060224 installed <13>Aug 14 04:19:00 rpmi: libpaper-1.1.26-alt1 sisyphus+221360.100.1.1 1549974198 installed <13>Aug 14 04:19:00 rpmi: libopenjpeg2.0-2.3.1-alt1 sisyphus+226454.100.1.1 1554284336 installed <13>Aug 14 04:19:00 rpmi: libnspr-1:4.27-alt1 sisyphus+255566.100.1.1 1596128006 installed <13>Aug 14 04:19:00 rpmi: libglvnd-7:1.3.2-alt1 sisyphus+254610.100.1.1 1594124262 installed <13>Aug 14 04:19:00 rpmi: libwayland-server-1.18.0-alt1 sisyphus+255795.100.1.1 1596475645 installed <13>Aug 14 04:19:00 rpmi: libp11-kit-0.23.15-alt2 sisyphus+252784.100.2.2 1591274915 installed <13>Aug 14 04:19:00 rpmi: libtasn1-4.16.0-alt1 sisyphus+245480.100.1.1 1580825069 installed <13>Aug 14 04:19:00 rpmi: libICE-1.0.10-alt1 sisyphus+247690.100.1.1 1584000383 installed <13>Aug 14 04:19:00 rpmi: libSM-1.2.3-alt1 sisyphus+226734.100.2.1 1554586157 installed <13>Aug 14 04:19:00 rpmi: libhogweed6-3.6-alt1 sisyphus+251637.100.3.1 1590060224 installed <13>Aug 14 04:19:00 rpmi: libgnutls30-3.6.14-alt1 sisyphus+252951.100.1.1 1591437590 installed <13>Aug 14 04:19:00 rpmi: libwayland-cursor-1.18.0-alt1 sisyphus+255795.100.1.1 1596475645 installed <13>Aug 14 04:19:00 rpmi: libwayland-egl-4:18.1.0-alt1 sisyphus+255795.100.1.1 1596475645 installed <13>Aug 14 04:19:00 rpmi: perl-File-Listing-6.04-alt1 1329758996 installed <13>Aug 14 04:19:00 rpmi: libqpdf28-10.0.1-alt1 sisyphus+249852.100.1.1 1586771254 installed <13>Aug 14 04:19:00 rpmi: libjasper-2.0.16-alt1 sisyphus+231386.100.1.1 1559568071 installed <13>Aug 14 04:19:00 rpmi: apache-commons-compress-0:1.18-alt1_4jpp8 sisyphus+230079.100.1.3 1558794575 installed <13>Aug 14 04:19:00 rpmi: ant-lib-0:1.10.5-alt1_5jpp8 sisyphus+232747.100.2.1 1561092977 installed <13>Aug 14 04:19:00 rpmi: bcel-1:6.2-alt1_4jpp8 sisyphus+234754.100.1.2 1563384029 installed <13>Aug 14 04:19:00 rpmi: slf4j-0:1.7.25-alt1_6jpp8 sisyphus+234787.100.1.2 1563401783 installed <13>Aug 14 04:19:00 rpmi: zip-30000000:3.0-alt1 1332241778 installed <13>Aug 14 04:19:00 rpmi: sgml-common-0.6.3-alt15 1423664786 installed <13>Aug 14 04:19:00 rpmi: docbook-dtds-4.5-alt1 1223476557 installed <13>Aug 14 04:19:00 rpmi: docbook-style-xsl-1.79.1-alt4 sisyphus+232871.100.1.1 1561238010 installed <13>Aug 14 04:19:00 rpmi: libnatspec-0.3.1-alt2 1445691578 installed <13>Aug 14 04:19:00 rpmi: unzip-6.0-alt3 sisyphus+244330.100.1.1 1579094112 installed <13>Aug 14 04:19:00 rpmi: libgdbm-1.8.3-alt10 1454943313 installed <13>Aug 14 04:19:00 rpmi: printer-testpages-2.0-alt2 1148643941 installed <13>Aug 14 04:19:01 rpmi: libgtk+2-locales-2.24.32-alt4 sisyphus+255972.200.2.1 1596837957 installed <13>Aug 14 04:19:01 rpmi: icon-theme-hicolor-0.17-alt2 sisyphus+248343.100.1.1 1584979043 installed <13>Aug 14 04:19:01 rpmi: libxkbcommon-0.10.0-alt1 sisyphus+244530.100.1.1 1579516270 installed <13>Aug 14 04:19:01 rpmi: libgudev-1:233-alt1 sisyphus+235422.100.1.1 1564855269 installed <13>Aug 14 04:19:01 rpmi: udev-rules-1:246.1-alt1 sisyphus+256133.100.1.1 1597083454 installed <13>Aug 14 04:19:01 rpmi: perl-Try-Tiny-0.30-alt1 1514318058 installed <13>Aug 14 04:19:01 rpmi: perl-IO-Socket-IP-0.39-alt1 1494508514 installed <13>Aug 14 04:19:01 rpmi: perl-Compress-Raw-Bzip2-2.095-alt1 sisyphus+255276.100.1.1 1595511686 installed <13>Aug 14 04:19:01 rpmi: perl-HTML-Tagset-3.20-alt2 1317725093 installed <13>Aug 14 04:19:01 rpmi: perl-Term-ANSIColor-5.01-alt1 sisyphus+244783.100.1.2 1579747505 installed <13>Aug 14 04:19:01 rpmi: perl-Data-Dump-1.23-alt1 1444601978 installed <13>Aug 14 04:19:01 rpmi: perl-Filter-1.59-alt1.1 sisyphus+219907.400.1.1 1548343225 installed <13>Aug 14 04:19:01 rpmi: perl-Encode-3.04-alt1 sisyphus+247835.100.1.1 1584190284 installed <13>Aug 14 04:19:01 rpmi: perl-URI-1.76-alt1 sisyphus+220243.100.1.1 1548863244 installed <13>Aug 14 04:19:01 rpmi: perl-IO-Compress-2.093-alt1 sisyphus+243543.100.1.1 1577294382 installed <13>Aug 14 04:19:01 rpmi: perl-Net-HTTP-6.19-alt1 sisyphus+229756.100.1.1 1558454558 installed <13>Aug 14 04:19:01 rpmi: perl-HTML-Parser-3.72-alt1.2 sisyphus+219907.600.1.1 1548343407 installed <13>Aug 14 04:19:01 rpmi: perl-WWW-RobotRules-6.02-alt1 1329756211 installed <13>Aug 14 04:19:01 rpmi: perl-Encode-Locale-1.05-alt1 1444608613 installed <13>Aug 14 04:19:01 rpmi: perl-IO-HTML-1.001-alt1 1404821752 installed <13>Aug 14 04:19:01 rpmi: perl-HTTP-Message-6.25-alt1 sisyphus+254521.100.1.1 1593894315 installed <13>Aug 14 04:19:01 rpmi: perl-HTTP-Cookies-6.08-alt1 sisyphus+242242.100.1.1 1575454022 installed <13>Aug 14 04:19:01 rpmi: perl-HTTP-Negotiate-6.01-alt1 1329760563 installed <13>Aug 14 04:19:01 rpmi: perl-libwww-6.46-alt1 sisyphus+254012.100.1.1 1593105927 installed <13>Aug 14 04:19:01 rpmi: perl-XML-LibXML-2.0202-alt1 sisyphus+246834.100.1.1 1582544045 installed <13>Aug 14 04:19:01 rpmi: perl-XML-SAX-1.02-alt1 sisyphus+232322.100.1.1 1560758406 installed <13>Aug 14 04:19:01 rpmi: perl-XML-Simple-2.25-alt1 1521437714 installed <13>Aug 14 04:19:01 rpmi: icon-naming-utils-0.8.90-alt1 1236573102 installed <13>Aug 14 04:19:02 rpmi: icon-theme-adwaita-3.36.1-alt1 sisyphus+250137.100.1.1 1587127395 installed <13>Aug 14 04:19:02 rpmi: libdatrie-0.2.9-alt1_6 1511686677 installed <13>Aug 14 04:19:02 rpmi: libthai-0.1.28-alt1_1 sisyphus+226107.100.1.1 1554123079 installed <13>Aug 14 04:19:02 rpmi: libgdk-pixbuf-locales-2.40.0-alt1 sisyphus+238952.140.2.1 1570644607 installed <13>Aug 14 04:19:02 rpmi: gtk+3-themes-incompatible-3.20-alt3 1461944560 installed <13>Aug 14 04:19:02 rpmi: libproxy-0.4.15-alt3.1 sisyphus+249308.100.1.1 1585930358 installed <13>Aug 14 04:19:02 rpmi: libwebp7-1.1.0-alt1 sisyphus+243895.100.1.1 1578410876 installed <13>Aug 14 04:19:02 rpmi: libjbig-2.1-alt1 1401380921 installed <13>Aug 14 04:19:02 rpmi: libtiff5-4.1.0-alt1 sisyphus+240802.100.1.1 1573743630 installed <13>Aug 14 04:19:02 rpmi: publicsuffix-list-dafsa-20200720-alt1 sisyphus+255208.100.1.1 1595349910 installed <13>Aug 14 04:19:02 rpmi: libpsl-0.21.1-alt1 sisyphus+255206.100.1.1 1595348931 installed <13>Aug 14 04:19:02 rpmi: libnghttp2-1.41.0-alt1 sisyphus+253680.100.1.1 1592642263 installed <13>Aug 14 04:19:02 rpmi: libverto-0.3.0-alt1_7 sisyphus+225932.100.1.1 1553994917 installed <13>Aug 14 04:19:02 rpmi: liblmdb-0.9.23-alt1 sisyphus+225277.100.2.1 1553001689 installed <13>Aug 14 04:19:02 rpmi: libkeyutils-1.6.1-alt1 sisyphus+256015.100.1.1 1596820121 installed <13>Aug 14 04:19:02 rpmi: libcom_err-1.44.6-alt1 sisyphus+224154.100.1.1 1552091653 installed <13>Aug 14 04:19:02 rpmi: poppler-data-0.4.9-alt1 sisyphus.216033.100 1541141723 installed <13>Aug 14 04:19:02 rpmi: libpixman-3:0.40.0-alt1 sisyphus+250700.100.1.1 1587970807 installed <13>Aug 14 04:19:02 rpmi: libbrotlicommon-1.0.7-alt1 sisyphus+226738.100.2.1 1554554565 installed <13>Aug 14 04:19:02 rpmi: libbrotlidec-1.0.7-alt1 sisyphus+226738.100.2.1 1554554565 installed <13>Aug 14 04:19:02 rpmi: libgraphite2-1.3.14-alt2 sisyphus+250009.100.1.1 1586943071 installed <13>Aug 14 04:19:02 rpmi: libharfbuzz-2.6.8-alt1 sisyphus+254028.100.1.1 1593106819 installed <13>Aug 14 04:19:02 rpmi: libfreetype-2.10.2-alt1 sisyphus+251736.100.1.1 1589531898 installed <13>Aug 14 04:19:02 rpmi: fontconfig-2.13.1-alt1 sisyphus+247349.100.1.2 1583841219 installed Updating fonts cache: <29>Aug 14 04:19:03 fontconfig: Updating fonts cache: succeeded [ DONE ] <13>Aug 14 04:19:03 rpmi: fonts-type1-xorg-7.0.0-alt4 1188553211 installed <13>Aug 14 04:19:03 rpmi: fonts-type1-urw-3:1.0.7pre44-alt3 sisyphus+224082.100.2.1 1552406640 installed <13>Aug 14 04:19:03 rpmi: libxshmfence-1.3-alt1 sisyphus+223149.1000.2.1 1551268594 installed <13>Aug 14 04:19:03 rpmi: libpciaccess-1:0.16-alt1 sisyphus+234814.100.1.1 1563438297 installed <13>Aug 14 04:19:03 rpmi: libdrm-1:2.4.102-alt1 sisyphus+252307.100.1.1 1590574831 installed <13>Aug 14 04:19:03 rpmi: libgbm-4:20.1.5-alt1 sisyphus+256154.300.1.1 1597137961 installed <13>Aug 14 04:19:03 rpmi: bc-1:1.07.1-alt1 sisyphus+221902.700.4.1 1550587848 installed <13>Aug 14 04:19:03 rpmi: libatk-locales-2.36.0-alt1 sisyphus+249208.100.1.1 1585840405 installed <13>Aug 14 04:19:03 rpmi: libatk-2.36.0-alt1 sisyphus+249208.100.1.1 1585840405 installed <13>Aug 14 04:19:04 rpmi: shared-mime-info-2.0-alt1 sisyphus+251302.100.1.1 1588847587 installed <13>Aug 14 04:19:04 rpmi: gsettings-desktop-schemas-data-3.36.1-alt1 sisyphus+250870.100.1.1 1588227108 installed <13>Aug 14 04:19:04 rpmi: libgio-2.64.4-alt1 sisyphus+254365.100.1.1 1593701005 installed <13>Aug 14 04:19:04 rpmi: gsettings-desktop-schemas-3.36.1-alt1 sisyphus+250870.100.1.1 1588227108 installed <13>Aug 14 04:19:04 rpmi: libgdk-pixbuf-2.40.0-alt1 sisyphus+238952.140.2.1 1570644607 installed <13>Aug 14 04:19:04 rpmi: gtk-update-icon-cache-3.24.21-alt1 sisyphus+254255.100.1.1 1593514263 installed <13>Aug 14 04:19:04 rpmi: libgusb-0.3.5-alt1 sisyphus+255577.100.1.1 1596150490 installed <13>Aug 14 04:19:04 rpmi: libcolord-1.4.4-alt2 sisyphus+229904.100.1.1 1558606512 installed <13>Aug 14 04:19:04 rpmi: libdconf-0.36.0-alt1 sisyphus+247780.1000.3.2 1584199651 installed <13>Aug 14 04:19:04 rpmi: libjson-glib-1.4.4-alt1 sisyphus.213175.100 1537249583 installed <13>Aug 14 04:19:04 rpmi: liblz4-1:1.9.2-alt1 sisyphus+238585.100.2.2 1570066861 installed <13>Aug 14 04:19:04 rpmi: libgpg-error-1.36-alt1 sisyphus+225621.300.1.1 1553521088 installed <13>Aug 14 04:19:04 rpmi: libgcrypt20-1.8.5-alt3 sisyphus+239622.100.1.1 1571746563 installed <13>Aug 14 04:19:04 rpmi: libsystemd-1:246.1-alt1 sisyphus+256133.100.1.1 1597083454 installed <13>Aug 14 04:19:04 rpmi: libdbus-1.12.18-alt1 sisyphus+252758.100.1.1 1591203684 installed <13>Aug 14 04:19:04 rpmi: libavahi-0.8-alt1 sisyphus+255349.240.4.1 1595604500 installed <13>Aug 14 04:19:04 rpmi: libcups-2.3.1-alt2 sisyphus+255816.100.2.1 1596533608 installed <13>Aug 14 04:19:05 rpmi: libgs-9.28-alt0.rc1.1 sisyphus+237325.100.1.1 1568103940 installed <13>Aug 14 04:19:05 rpmi: ghostscript-common-9.28-alt0.rc1.1 sisyphus+237325.100.1.1 1568103940 installed <13>Aug 14 04:19:05 rpmi: ghostscript-classic-9.28-alt0.rc1.1 sisyphus+237325.100.1.1 1568103940 installed <13>Aug 14 04:19:05 rpmi: cups-filters-libs-1.27.5-alt1 sisyphus+253475.100.1.1 1592227583 installed <13>Aug 14 04:19:05 rpmi: libavahi-glib-0.8-alt1 sisyphus+255349.240.4.1 1595604500 installed <13>Aug 14 04:19:05 rpmi: dbus-tools-1.12.18-alt1 sisyphus+252758.100.1.1 1591203684 installed <86>Aug 14 04:19:05 groupadd[4158117]: group added to /etc/group: name=messagebus, GID=499 <86>Aug 14 04:19:05 groupadd[4158117]: group added to /etc/gshadow: name=messagebus <86>Aug 14 04:19:05 groupadd[4158117]: new group: name=messagebus, GID=499 <86>Aug 14 04:19:05 useradd[4158129]: new user: name=messagebus, UID=499, GID=499, home=/run/dbus, shell=/dev/null <13>Aug 14 04:19:05 rpmi: dbus-1.12.18-alt1 sisyphus+252758.100.1.1 1591203684 installed <13>Aug 14 04:19:05 rpmi: dconf-0.36.0-alt1 sisyphus+247780.1000.3.2 1584199651 installed <13>Aug 14 04:19:05 rpmi: libgtk+3-schemas-3.24.21-alt1 sisyphus+254255.100.1.1 1593514263 installed <13>Aug 14 04:19:05 rpmi: libpolkit-0.117-alt1 sisyphus+255723.100.1.1 1596386037 installed <86>Aug 14 04:19:05 groupadd[4158226]: group added to /etc/group: name=colord, GID=498 <86>Aug 14 04:19:05 groupadd[4158226]: group added to /etc/gshadow: name=colord <86>Aug 14 04:19:05 groupadd[4158226]: new group: name=colord, GID=498 <86>Aug 14 04:19:05 useradd[4158236]: new user: name=colord, UID=498, GID=498, home=/var/colord, shell=/dev/null <13>Aug 14 04:19:05 rpmi: colord-1.4.4-alt2 sisyphus+229904.100.1.1 1558606512 installed <13>Aug 14 04:19:05 rpmi: libxslt-1.1.34-alt2 sisyphus+248264.100.1.1 1584829787 installed <13>Aug 14 04:19:05 rpmi: libX11-locales-3:1.6.11-alt1 sisyphus+256231.100.1.1 1597310848 installed <13>Aug 14 04:19:06 rpmi: libXdmcp-1.1.3-alt1 sisyphus+225206.600.1.2 1552949347 installed <13>Aug 14 04:19:06 rpmi: libXau-1.0.9-alt1 sisyphus+223149.200.2.1 1551268156 installed <13>Aug 14 04:19:06 rpmi: libxcb-1.14-alt1 sisyphus+247358.200.1.3 1583854223 installed <13>Aug 14 04:19:06 rpmi: libX11-3:1.6.11-alt1 sisyphus+256231.100.1.1 1597310848 installed <13>Aug 14 04:19:06 rpmi: libXext-1.3.4-alt1 sisyphus+225206.700.1.2 1552949422 installed <13>Aug 14 04:19:06 rpmi: libXrender-0.9.8-alt1 1371312110 installed <13>Aug 14 04:19:06 rpmi: libXi-1.7.10-alt1 sisyphus+232786.300.1.1 1561106975 installed <13>Aug 14 04:19:06 rpmi: libXfixes-5.0.3-alt1 sisyphus+226736.100.2.2 1554614842 installed <13>Aug 14 04:19:06 rpmi: libXtst-1.2.2-alt1 1369984880 installed <13>Aug 14 04:19:06 rpmi: libXdamage-1.1.5-alt1 sisyphus+225206.500.1.2 1552949282 installed <13>Aug 14 04:19:06 rpmi: libXcomposite-0.4.5-alt1 sisyphus+225206.300.1.2 1552949136 installed <13>Aug 14 04:19:06 rpmi: libXcursor-1.2.0-alt1 sisyphus+225206.400.1.2 1552949214 installed <13>Aug 14 04:19:06 rpmi: libXrandr-1.5.2-alt1 sisyphus+225206.1300.1.2 1552949698 installed <13>Aug 14 04:19:06 rpmi: libXinerama-1.1.4-alt1 sisyphus+223149.300.2.1 1551268223 installed <13>Aug 14 04:19:06 rpmi: libat-spi2-core-2.36.0-alt1 sisyphus+247780.1600.3.2 1584200247 installed <13>Aug 14 04:19:06 rpmi: libXft-2.3.3-alt1 sisyphus+225206.1000.3.2 1552987714 installed <13>Aug 14 04:19:06 rpmi: libXxf86vm-1.1.4-alt2 1527672159 installed <13>Aug 14 04:19:06 rpmi: libGLX-mesa-4:20.1.5-alt1 sisyphus+256154.300.1.1 1597137961 installed <13>Aug 14 04:19:06 rpmi: libEGL-mesa-4:20.1.5-alt1 sisyphus+256154.300.1.1 1597137961 installed <13>Aug 14 04:19:06 rpmi: libEGL-7:1.3.2-alt1 sisyphus+254610.100.1.1 1594124262 installed <13>Aug 14 04:19:06 rpmi: libGLX-7:1.3.2-alt1 sisyphus+254610.100.1.1 1594124262 installed <13>Aug 14 04:19:06 rpmi: libGL-7:1.3.2-alt1 sisyphus+254610.100.1.1 1594124262 installed <13>Aug 14 04:19:06 rpmi: libcairo-1:1.16.0-alt1 sisyphus+226534.100.2.3 1554515520 installed <13>Aug 14 04:19:06 rpmi: libpango-1.45.5-alt1 sisyphus+255972.100.2.1 1596837790 installed <13>Aug 14 04:19:06 rpmi: libgtk+2-2.24.32-alt4 sisyphus+255972.200.2.1 1596837957 installed <13>Aug 14 04:19:06 rpmi: libgail-2.24.32-alt4 sisyphus+255972.200.2.1 1596837957 installed <13>Aug 14 04:19:06 rpmi: libcairo-gobject-1:1.16.0-alt1 sisyphus+226534.100.2.3 1554515520 installed <13>Aug 14 04:19:06 rpmi: dbus-tools-gui-1.12.18-alt1 sisyphus+252758.100.1.1 1591203684 installed <13>Aug 14 04:19:06 rpmi: at-spi2-core-2.36.0-alt1 sisyphus+247780.1600.3.2 1584200247 installed <13>Aug 14 04:19:06 rpmi: at-spi2-atk-2.34.2-alt1 sisyphus+247242.200.7.1 1583839985 installed <13>Aug 14 04:19:06 rpmi: rpm-macros-alternatives-0.5.1-alt1 sisyphus+226946.100.1.1 1554830426 installed <13>Aug 14 04:19:06 rpmi: alternatives-0.5.1-alt1 sisyphus+226946.100.1.1 1554830426 installed <13>Aug 14 04:19:06 rpmi: libnss-3.55.0-alt1 sisyphus+255566.200.1.1 1596128213 installed <13>Aug 14 04:19:06 rpmi: ca-certificates-2020.06.29-alt1 sisyphus+254237.300.1.1 1593450881 installed <13>Aug 14 04:19:06 rpmi: ca-trust-0.1.2-alt1 sisyphus+233348.100.1.1 1561653823 installed <13>Aug 14 04:19:06 rpmi: p11-kit-trust-0.23.15-alt2 sisyphus+252784.100.2.2 1591274915 installed <13>Aug 14 04:19:06 rpmi: libcrypto1.1-1.1.1g-alt1 sisyphus+249982.60.8.1 1587743567 installed <13>Aug 14 04:19:06 rpmi: libssl1.1-1.1.1g-alt1 sisyphus+249982.60.8.1 1587743567 installed <13>Aug 14 04:19:06 rpmi: libpython3-3.8.5-alt1 sisyphus+244405.100.3.1 1595544264 installed <13>Aug 14 04:19:06 rpmi: python3-3.8.5-alt1 sisyphus+244405.100.3.1 1595544264 installed <13>Aug 14 04:19:07 rpmi: python3-base-3.8.5-alt1 sisyphus+244405.100.3.1 1595544264 installed <86>Aug 14 04:19:07 groupadd[4161196]: group added to /etc/group: name=_keytab, GID=497 <86>Aug 14 04:19:07 groupadd[4161196]: group added to /etc/gshadow: name=_keytab <86>Aug 14 04:19:07 groupadd[4161196]: new group: name=_keytab, GID=497 <13>Aug 14 04:19:07 rpmi: libkrb5-1.18.2-alt2 sisyphus+254565.100.4.1 1594375563 installed <13>Aug 14 04:19:07 rpmi: python3-module-sugarbowl-0.52.1-alt1.git20141130.1.1 sisyphus+227470.1100.1.1 1555687657 installed <13>Aug 14 04:19:07 rpmi: python3-module-six-1.15.0-alt1 sisyphus+255738.100.2.1 1596527214 installed <13>Aug 14 04:19:07 rpmi: ca-trust-java-0.1.2-alt1 sisyphus+233348.100.1.1 1561653823 installed <13>Aug 14 04:19:09 rpmi: java-1.8.0-openjdk-headless-0:1.8.0.212.b04-alt2_0jpp8 sisyphus+255828.100.2.1 1596585080 installed <86>Aug 14 04:19:10 groupadd[4165573]: group added to /etc/group: name=sasl, GID=496 <86>Aug 14 04:19:10 groupadd[4165573]: group added to /etc/gshadow: name=sasl <86>Aug 14 04:19:10 groupadd[4165573]: new group: name=sasl, GID=496 <13>Aug 14 04:19:10 rpmi: libsasl2-3-2.1.27-alt2.1 sisyphus+255909.100.2.1 1597199521 installed <13>Aug 14 04:19:10 rpmi: libldap-2.4.48-alt3 sisyphus+238816.100.1.1 1570449061 installed <13>Aug 14 04:19:10 rpmi: libcurl-7.71.1-alt1 sisyphus+254403.100.1.1 1593776497 installed <13>Aug 14 04:19:10 rpmi: libpoppler97-0.86.1-alt1 sisyphus+247631.100.1.1 1583927472 installed <13>Aug 14 04:19:10 rpmi: poppler-0.86.1-alt1 sisyphus+247631.100.1.1 1583927472 installed <13>Aug 14 04:19:10 rpmi: libpoppler0-cpp-0.86.1-alt1 sisyphus+247631.100.1.1 1583927472 installed <13>Aug 14 04:19:10 rpmi: cups-filters-1.27.5-alt1 sisyphus+253475.100.1.1 1592227583 installed <13>Aug 14 04:19:10 rpmi: cups-2.3.1-alt2 sisyphus+255816.100.2.1 1596533608 installed <13>Aug 14 04:19:13 rpmi: java-10-openjdk-headless-0:10.0.2.13-alt1_7jpp9 sisyphus+234186.100.1.2 1562725639 installed <13>Aug 14 04:19:14 rpmi: python3-module-markupsafe-1.1.1-alt1 sisyphus+248369.100.1.1 1585046156 installed <13>Aug 14 04:19:14 rpmi: python3-module-jinja2-2.11.2-alt1 sisyphus+254573.100.1.1 1594043344 installed <13>Aug 14 04:19:14 rpmi: python3-module-clyde-0.8.0-alt1.git20141130.2.1 sisyphus+227465.1600.1.2 1555756906 installed <13>Aug 14 04:19:14 rpmi: python3-module-pkg_resources-1:46.1.3-alt1 sisyphus+250566.200.3.1 1587973342 installed <13>Aug 14 04:19:14 rpmi: python3-module-runfile-0.46.1-alt1.git20141130.2.1 sisyphus+227469.1300.2.3 1555706376 installed <13>Aug 14 04:19:14 rpmi: objectweb-asm-0:7.0-alt1_4jpp8 sisyphus+246362.100.1.3 1581801326 installed <13>Aug 14 04:19:14 rpmi: xmvn-install-3.0.0-alt1_23jpp8 sisyphus+234592.200.1.1 1563216657 installed <13>Aug 14 04:19:14 rpmi: xmvn-subst-3.0.0-alt1_23jpp8 sisyphus+234592.200.1.1 1563216657 installed <13>Aug 14 04:19:14 rpmi: xmvn-resolve-3.0.0-alt1_23jpp8 sisyphus+234592.200.1.1 1563216657 installed <13>Aug 14 04:19:14 rpmi: xml-commons-resolver-0:1.2-alt1_29jpp8 sisyphus+246085.100.1.1 1581616616 installed <13>Aug 14 04:19:14 rpmi: xalan-j2-0:2.7.1-alt4_39jpp8 sisyphus+230759.100.1.3 1559127607 installed <13>Aug 14 04:19:14 rpmi: xerces-j2-0:2.12.0-alt1_4jpp8 sisyphus+246082.100.1.1 1581615230 installed <13>Aug 14 04:19:14 rpmi: python3-module-genshi-0.7-alt2 sisyphus+229363.100.1.1 1557847321 installed <13>Aug 14 04:19:14 rpmi: python3-module-webencodings-0.5.1-alt2 sisyphus+245915.200.1.1 1581496105 installed <13>Aug 14 04:19:15 rpmi: python3-module-cssselect-0.9.1-alt2 sisyphus+250566.2300.6.1 1588188959 installed <13>Aug 14 04:19:15 rpmi: python3-module-html5lib-1:1.0.1-alt1 sisyphus+238807.100.2.1 1570465973 installed <13>Aug 14 04:19:15 rpmi: python3-module-lxml-4.5.0-alt2 sisyphus+250566.2700.6.1 1588189447 installed <13>Aug 14 04:19:15 rpmi: python3-module-javapackages-1:5.3.0-alt1_4jpp8 sisyphus+234467.100.1.1 1563037789 installed <13>Aug 14 04:19:15 rpmi: rpm-build-java-1:5.3.0-alt1_4jpp8 sisyphus+234467.100.1.1 1563037789 installed <13>Aug 14 04:19:15 rpmi: glib-networking-2.64.3-alt1 sisyphus+251581.1300.3.3 1590773456 installed <13>Aug 14 04:19:15 rpmi: libsoup-2.70.0-alt1 sisyphus+247780.1300.3.2 1584199886 installed <13>Aug 14 04:19:15 rpmi: libsoup-gnome-2.70.0-alt1 sisyphus+247780.1300.3.2 1584199886 installed <13>Aug 14 04:19:15 rpmi: librest-0.8.1-alt1 1508266396 installed <13>Aug 14 04:19:15 rpmi: libgtk+3-3.24.21-alt1 sisyphus+254255.100.1.1 1593514263 installed <13>Aug 14 04:19:15 rpmi: gtk3-demo-3.24.21-alt1 sisyphus+254255.100.1.1 1593514263 installed <13>Aug 14 04:19:15 rpmi: libgail3-3.24.21-alt1 sisyphus+254255.100.1.1 1593514263 installed <13>Aug 14 04:19:15 rpmi: java-stub-javadoc-0.1-alt1 1229813340 installed <13>Aug 14 04:19:15 rpmi: alsa-ucm-conf-1.2.3-alt1 sisyphus+253139.200.1.1 1591812001 installed <13>Aug 14 04:19:15 rpmi: alsa-topology-conf-1.2.3-alt1 sisyphus+253139.100.1.1 1591811985 installed <13>Aug 14 04:19:15 rpmi: libalsa-1:1.2.3.2-alt1 sisyphus+254690.100.1.1 1594280085 installed <13>Aug 14 04:19:16 rpmi: java-1.8.0-openjdk-0:1.8.0.212.b04-alt2_0jpp8 sisyphus+255828.100.2.1 1596585080 installed <13>Aug 14 04:19:16 rpmi: java-1.8.0-openjdk-devel-0:1.8.0.212.b04-alt2_0jpp8 sisyphus+255828.100.2.1 1596585080 installed <13>Aug 14 04:19:16 rpmi: java-10-openjdk-0:10.0.2.13-alt1_7jpp9 sisyphus+234186.100.1.2 1562725639 installed <13>Aug 14 04:19:17 rpmi: java-10-openjdk-devel-0:10.0.2.13-alt1_7jpp9 sisyphus+234186.100.1.2 1562725639 installed <13>Aug 14 04:19:17 rpmi: jpackage-generic-compat-0.30-alt1 sisyphus+234288.100.1.1 1562847521 installed <13>Aug 14 04:19:17 rpmi: javapackages-local-1:5.3.0-alt1_4jpp8 sisyphus+234467.100.1.1 1563037789 installed <13>Aug 14 04:19:17 rpmi: nekohtml-0:1.9.22-alt1_11jpp8 sisyphus+246358.100.1.1 1581799490 installed <13>Aug 14 04:19:17 rpmi: ant-0:1.10.5-alt1_5jpp8 sisyphus+232747.100.2.1 1561092977 installed Building target platforms: x86_64 Building for target x86_64 Wrote: /usr/src/in/nosrpm/boilerpipe-1.2.0-alt1_13jpp8.nosrc.rpm Installing boilerpipe-1.2.0-alt1_13jpp8.src.rpm Building target platforms: x86_64 Building for target x86_64 Executing(%prep): /bin/sh -e /usr/src/tmp/rpm-tmp.42061 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + rm -rf boilerpipe-1.2.0 + echo 'Source #0 (boilerpipe-1.2.0-src.tar.gz):' Source #0 (boilerpipe-1.2.0-src.tar.gz): + /bin/gzip -dc /usr/src/RPM/SOURCES/boilerpipe-1.2.0-src.tar.gz + /bin/tar -xf - + cd boilerpipe-1.2.0 + /bin/chmod -c -Rf u+rwX,go-w . + find . -iname '*.jar' -delete + find . -iname '*.class' -delete + echo 'Patch #0 (boilerpipe-1.2.0-libdir-patch):' Patch #0 (boilerpipe-1.2.0-libdir-patch): + /usr/bin/patch -p0 patching file build.xml + cp /usr/src/RPM/SOURCES/boilerpipe-1.2.0.pom pom.xml + echo 'Patch #1 (boilerpipe-1.2.0-nekohtml-patch):' Patch #1 (boilerpipe-1.2.0-nekohtml-patch): + /usr/bin/patch -p1 patching file pom.xml patching file src/main/org/cyberneko/html/HTMLElements.java patching file src/main/org/cyberneko/html/HTMLTagBalancer.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextBlock.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/document/TextDocument.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/TagAction.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java + for s in src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeInput.java src/main/de/l3s/boilerpipe/BoilerpipeFilter.java src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java src/main/de/l3s/boilerpipe/BoilerpipeProcessingException.java src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java src/main/de/l3s/boilerpipe/document/TextBlock.java src/main/de/l3s/boilerpipe/document/TextDocumentStatistics.java src/main/de/l3s/boilerpipe/document/TextDocument.java src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingWithMinKWordsExtractor.java src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java src/main/de/l3s/boilerpipe/extractors/CommonExtractors.java src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java src/main/de/l3s/boilerpipe/extractors/KeepEverythingExtractor.java src/main/de/l3s/boilerpipe/filters/english/HeuristicFilterBase.java src/main/de/l3s/boilerpipe/filters/english/KeepLargestFulltextBlockFilter.java src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java src/main/de/l3s/boilerpipe/filters/english/MinFulltextWordsFilter.java src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/KeepLargestBlockFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java src/main/de/l3s/boilerpipe/filters/heuristics/ContentFusion.java src/main/de/l3s/boilerpipe/filters/simple/MinWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToBoilerplateFilter.java src/main/de/l3s/boilerpipe/filters/simple/LabelToContentFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/InvertedFilter.java src/main/de/l3s/boilerpipe/filters/simple/MinClauseWordsFilter.java src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java src/main/de/l3s/boilerpipe/filters/simple/MarkEverythingContentFilter.java src/main/de/l3s/boilerpipe/labels/DefaultLabels.java src/main/de/l3s/boilerpipe/labels/ConditionalLabelAction.java src/main/de/l3s/boilerpipe/labels/LabelAction.java src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java src/main/de/l3s/boilerpipe/sax/TagActionMap.java src/main/de/l3s/boilerpipe/sax/InputSourceable.java src/main/de/l3s/boilerpipe/sax/HTMLDocument.java src/main/de/l3s/boilerpipe/sax/CommonTagActions.java src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java src/main/de/l3s/boilerpipe/sax/TagAction.java src/main/de/l3s/boilerpipe/sax/MarkupTagAction.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + native2ascii -encoding UTF8 src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java src/main/de/l3s/boilerpipe/util/UnicodeTokenizer.java + exit 0 Executing(%build): /bin/sh -e /usr/src/tmp/rpm-tmp.76442 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + cd boilerpipe-1.2.0 + ant -Dapp.javaversion=1.6 Buildfile: /usr/src/RPM/BUILD/boilerpipe-1.2.0/build.xml clean: [mkdir] Created dir: /usr/src/RPM/BUILD/boilerpipe-1.2.0/javadoc/1.2 init: [mkdir] Created dir: /usr/src/RPM/BUILD/boilerpipe-1.2.0/build [mkdir] Created dir: /usr/src/RPM/BUILD/boilerpipe-1.2.0/build/main [mkdir] Created dir: /usr/src/RPM/BUILD/boilerpipe-1.2.0/build/demo [mkdir] Created dir: /usr/src/RPM/BUILD/boilerpipe-1.2.0/dist javadoc: [javadoc] Generating Javadoc [javadoc] Javadoc execution [javadoc] Loading source files for package de.l3s.boilerpipe... [javadoc] Loading source files for package de.l3s.boilerpipe.conditions... [javadoc] Loading source files for package de.l3s.boilerpipe.document... [javadoc] Loading source files for package de.l3s.boilerpipe.estimators... [javadoc] Loading source files for package de.l3s.boilerpipe.extractors... [javadoc] Loading source files for package de.l3s.boilerpipe.filters.english... [javadoc] Loading source files for package de.l3s.boilerpipe.filters.heuristics... [javadoc] Loading source files for package de.l3s.boilerpipe.filters.simple... [javadoc] Loading source files for package de.l3s.boilerpipe.labels... [javadoc] Loading source files for package de.l3s.boilerpipe.sax... [javadoc] Loading source files for package de.l3s.boilerpipe.util... [javadoc] Constructing Javadoc information... [javadoc] Standard Doclet version 1.8.0_212 [javadoc] Building tree for all the packages and classes... [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java:21: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java:33: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java:44: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/BoilerpipeExtractor.java:54: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/BoilerpipeFilter.java:36: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/BoilerpipeInput.java:32: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java:33: warning: no description for @param [javadoc] * @param tb [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/conditions/TextBlockCondition.java:34: error: malformed HTML [javadoc] * @return iff the condition is met. [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/document/TextBlock.java:252: warning: no description for @return [javadoc] * @return [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/document/TextDocument.java:78: warning: no description for @param [javadoc] * @param title [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java:46: warning: no description for @param [javadoc] * @param dsBefore [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/estimators/SimpleEstimator.java:47: warning: no description for @param [javadoc] * @param dsAfter [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/ArticleExtractor.java:43: warning: no @return [javadoc] public static ArticleExtractor getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java:47: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java:64: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java:83: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java:98: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/ExtractorBase.java:109: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/ArticleSentencesExtractor.java:36: warning: no @return [javadoc] public static ArticleSentencesExtractor getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/CanolaExtractor.java:43: warning: no @return [javadoc] public static CanolaExtractor getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/DefaultExtractor.java:37: warning: no @return [javadoc] public static DefaultExtractor getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/LargestContentExtractor.java:42: warning: no @return [javadoc] public static LargestContentExtractor getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/extractors/NumWordsRulesExtractor.java:36: warning: no @return [javadoc] public static NumWordsRulesExtractor getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/english/DensityRulesClassifier.java:43: warning: no @return [javadoc] public static DensityRulesClassifier getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFilter.java:47: warning: no @return [javadoc] public static IgnoreBlocksAfterContentFilter getDefaultInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/english/NumWordsRulesClassifier.java:42: warning: no @return [javadoc] public static NumWordsRulesClassifier getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/english/TerminatingBlocksFinder.java:40: warning: no @return [javadoc] public static TerminatingBlocksFinder getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java:44: error: @param name not found [javadoc] * @param maxBlocksDistance The maximum distance in blocks. [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java:45: error: @param name not found [javadoc] * @param contentOnly [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java:45: warning: no description for @param [javadoc] * @param contentOnly [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/AddPrecedingLabelsFilter.java:47: warning: no @param for labelPrefix [javadoc] public AddPrecedingLabelsFilter(final String labelPrefix) { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java:55: warning: no description for @param [javadoc] * @param contentOnly [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/BlockProximityFusion.java:57: warning: no @param for sameTagLevelOnly [javadoc] public BlockProximityFusion(final int maxBlocksDistance, [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/ExpandTitleToContentFilter.java:40: warning: no @return [javadoc] public static ExpandTitleToContentFilter getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java:45: error: @param name not found [javadoc] * @param maxBlocksDistance The maximum distance in blocks. [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java:46: error: @param name not found [javadoc] * @param contentOnly [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java:46: warning: no description for @param [javadoc] * @param contentOnly [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/LabelFusion.java:48: warning: no @param for labelPrefix [javadoc] public LabelFusion(final String labelPrefix) { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/heuristics/SimpleBlockFusionProcessor.java:39: warning: no @return [javadoc] public static SimpleBlockFusionProcessor getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/simple/BoilerplateBlockFilter.java:39: warning: no @return [javadoc] public static BoilerplateBlockFilter getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/simple/SplitParagraphBlocksFilter.java:45: warning: no @return [javadoc] public static SplitParagraphBlocksFilter getInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLParser.java:47: warning: no description for @param [javadoc] * @param contentHandler [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java:59: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java:40: warning: no description for @param [javadoc] * @param is [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/BoilerpipeSAXInput.java:41: warning: no description for @throws [javadoc] * @throws SAXException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java:27: warning: no description for @param [javadoc] * @param url [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java:28: warning: no description for @return [javadoc] * @return [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLFetcher.java:29: warning: no description for @throws [javadoc] * @throws IOException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:54: warning: no @return [javadoc] public static HTMLHighlighter newHighlightingInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:62: warning: no @return [javadoc] public static HTMLHighlighter newExtractingInstance() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:88: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:90: warning: no @return [javadoc] public String process(final TextDocument doc, final String origHTML) [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:103: warning: no description for @throws [javadoc] * @throws BoilerpipeProcessingException [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:105: warning: no @return [javadoc] public String process(final TextDocument doc, final InputSource is) [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:162: warning: no @return [javadoc] public boolean isOutputHighlightOnly() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:170: warning: no @param for outputHighlightOnly [javadoc] public void setOutputHighlightOnly(boolean outputHighlightOnly) { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:181: warning: no @return [javadoc] public String getExtraStyleSheet() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:203: error: invalid entity &qupt; [javadoc] * <span class=&qupt;x-boilerpipe-mark1"> [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:205: warning: no @return [javadoc] public String getPreHighlight() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:215: warning: no @param for preHighlight [javadoc] public void setPreHighlight(String preHighlight) { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:225: warning: no @return [javadoc] public String getPostHighlight() { [javadoc] ^ [javadoc] /usr/src/RPM/BUILD/boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/sax/HTMLHighlighter.java:234: warning: no @param for postHighlight [javadoc] public void setPostHighlight(String postHighlight) { [javadoc] ^ [javadoc] Building index for all the packages and classes... [javadoc] Building index for all classes... [javadoc] Generating /usr/src/RPM/BUILD/boilerpipe-1.2.0/javadoc/1.2/help-doc.html... [javadoc] 6 errors [javadoc] 56 warnings compile: [javac] /usr/src/RPM/BUILD/boilerpipe-1.2.0/build.xml:93: warning: 'includeantruntime' was not set, defaulting to build.sysclasspath=last; set to false for repeatable builds [javac] Compiling 62 source files to /usr/src/RPM/BUILD/boilerpipe-1.2.0/build/main [javac] warning: [options] bootstrap class path not set in conjunction with -source 1.6 [javac] 1 warning [javac] /usr/src/RPM/BUILD/boilerpipe-1.2.0/build.xml:94: warning: 'includeantruntime' was not set, defaulting to build.sysclasspath=last; set to false for repeatable builds [javac] Compiling 3 source files to /usr/src/RPM/BUILD/boilerpipe-1.2.0/build/demo [javac] warning: [options] bootstrap class path not set in conjunction with -source 1.6 [javac] 1 warning jars: [jar] Building jar: /usr/src/RPM/BUILD/boilerpipe-1.2.0/dist/boilerpipe-1.2.0.jar [jar] Building jar: /usr/src/RPM/BUILD/boilerpipe-1.2.0/dist/boilerpipe-demo-1.2.0.jar [jar] Building jar: /usr/src/RPM/BUILD/boilerpipe-1.2.0/dist/boilerpipe-javadoc-1.2.0.jar [jar] Building jar: /usr/src/RPM/BUILD/boilerpipe-1.2.0/dist/boilerpipe-sources-1.2.0.jar dist: [tar] Building tar: /usr/src/RPM/BUILD/boilerpipe-1.2.0/dist/boilerpipe-1.2.0-bin.tar.gz [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/extractors/class-use/KeepEverythingWithMinKWordsExtractor.html longer than 100 characters. [tar] Resulting tar file can only be processed successfully by GNU compatible tar commands [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/class-use/DensityRulesClassifier.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/class-use/IgnoreBlocksAfterContentFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/class-use/IgnoreBlocksAfterContentFromEndFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/class-use/KeepLargestFulltextBlockFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/class-use/MinFulltextWordsFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/class-use/NumWordsRulesClassifier.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/english/class-use/TerminatingBlocksFinder.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/heuristics/class-use/AddPrecedingLabelsFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/heuristics/class-use/ArticleMetadataFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/heuristics/class-use/BlockProximityFusion.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/heuristics/class-use/DocumentTitleMatchClassifier.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/heuristics/class-use/ExpandTitleToContentFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/heuristics/class-use/KeepLargestBlockFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/heuristics/class-use/SimpleBlockFusionProcessor.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/simple/class-use/LabelToBoilerplateFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/simple/class-use/MarkEverythingContentFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/simple/class-use/SplitParagraphBlocksFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/filters/simple/class-use/SurroundingToContentFilter.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/sax/class-use/CommonTagActions.BlockTagLabelAction.html longer than 100 characters. [tar] Entry: boilerpipe-1.2.0/javadoc/1.2/de/l3s/boilerpipe/sax/class-use/CommonTagActions.InlineTagLabelAction.html longer than 100 characters. [tar] Building tar: /usr/src/RPM/BUILD/boilerpipe-1.2.0/dist/boilerpipe-1.2.0-src.tar.gz [tar] Entry: boilerpipe-1.2.0/src/main/de/l3s/boilerpipe/filters/english/IgnoreBlocksAfterContentFromEndFilter.java longer than 100 characters. [tar] Resulting tar file can only be processed successfully by GNU compatible tar commands BUILD SUCCESSFUL Total time: 3 seconds + exit 0 Executing(%install): /bin/sh -e /usr/src/tmp/rpm-tmp.8894 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + /bin/chmod -Rf u+rwX -- /usr/src/tmp/boilerpipe-buildroot + : + /bin/rm -rf -- /usr/src/tmp/boilerpipe-buildroot + cd boilerpipe-1.2.0 + /usr/bin/python3 /usr/share/java-utils/mvn_artifact.py pom.xml dist/boilerpipe-1.2.0.jar + /usr/bin/python3 /usr/share/java-utils/mvn_file.py de.l3s.boilerpipe:boilerpipe boilerpipe + xmvn-install -R .xmvn-reactor -n boilerpipe -d /usr/src/tmp/boilerpipe-buildroot [INFO] Installing artifact de.l3s.boilerpipe:boilerpipe:pom:1.2.0 [INFO] Installing artifact de.l3s.boilerpipe:boilerpipe:jar:1.2.0 [INFO] Installation successful + jdir=javadoc/1.2 + /bin/mkdir -p /usr/src/tmp/boilerpipe-buildroot/usr/share/license + '[' -d javadoc/1.2 ']' + install -dm755 /usr/src/tmp/boilerpipe-buildroot/usr/share/javadoc/boilerpipe + cp -pr javadoc/1.2/allclasses-frame.html javadoc/1.2/allclasses-noframe.html javadoc/1.2/constant-values.html javadoc/1.2/de javadoc/1.2/deprecated-list.html javadoc/1.2/help-doc.html javadoc/1.2/index-all.html javadoc/1.2/index.html javadoc/1.2/overview-frame.html javadoc/1.2/overview-summary.html javadoc/1.2/overview-tree.html javadoc/1.2/package-list javadoc/1.2/script.js javadoc/1.2/serialized-form.html javadoc/1.2/stylesheet.css /usr/src/tmp/boilerpipe-buildroot/usr/share/javadoc/boilerpipe + echo /usr/share/javadoc/boilerpipe + install -pm 644 dist/boilerpipe-demo-1.2.0.jar /usr/src/tmp/boilerpipe-buildroot/usr/share/java/boilerpipe-demo.jar + /usr/lib/rpm/brp-alt Cleaning files in /usr/src/tmp/boilerpipe-buildroot (auto) Verifying and fixing files in /usr/src/tmp/boilerpipe-buildroot (binconfig,pkgconfig,libtool,desktop) Checking contents of files in /usr/src/tmp/boilerpipe-buildroot/ (default) Compressing files in /usr/src/tmp/boilerpipe-buildroot (auto) Verifying ELF objects in /usr/src/tmp/boilerpipe-buildroot (arch=normal,fhs=normal,lfs=relaxed,lint=relaxed,rpath=normal,stack=normal,textrel=normal,unresolved=normal) Hardlinking identical .pyc and .pyo files Processing files: boilerpipe-1.2.0-alt1_13jpp8 Executing(%doc): /bin/sh -e /usr/src/tmp/rpm-tmp.82329 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + cd boilerpipe-1.2.0 + DOCDIR=/usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-1.2.0 + export DOCDIR + rm -rf /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-1.2.0 + /bin/mkdir -p /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-1.2.0 + cp -prL --no-dereference LICENSE.txt NOTICE.txt /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-1.2.0 + chmod -R go-w /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-1.2.0 + chmod -R a+rX /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-1.2.0 + exit 0 Finding Provides (using /usr/lib/rpm/find-provides) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.mfsZPs find-provides: running scripts (alternatives,debuginfo,lib,maven,osgi-fc,pam,perl,pkgconfig,python,shell) [INFO maven.prov] input: ['/usr/src/tmp/boilerpipe-buildroot/usr/share/maven-metadata/boilerpipe.xml'] [INFO maven.prov] mvn(de.l3s.boilerpipe:boilerpipe:pom:) = 1.2.0, mvn(de.l3s.boilerpipe:boilerpipe) = 1.2.0 [INFO osgi.prov] input: ['/usr/src/tmp/boilerpipe-buildroot/usr/share/java/boilerpipe-demo.jar', '/usr/src/tmp/boilerpipe-buildroot/usr/share/java/boilerpipe.jar'] Finding Requires (using /usr/lib/rpm/find-requires) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.ehmd5t find-requires: running scripts (cpp,debuginfo,files,javadoc,lib,maven,osgi-fc,pam,perl,pkgconfig,pkgconfiglib,python,rpmlib,shebang,shell,static,symlinks,systemd-services) [INFO maven.req] input: ['/usr/src/tmp/boilerpipe-buildroot/usr/share/maven-metadata/boilerpipe.xml'] [INFO maven.req] javapackages-filesystem, mvn(net.sourceforge.nekohtml:nekohtml) [INFO osgi.req] input: ['/usr/src/tmp/boilerpipe-buildroot/usr/share/java/boilerpipe-demo.jar', '/usr/src/tmp/boilerpipe-buildroot/usr/share/java/boilerpipe.jar'] Provides: mvn(de.l3s.boilerpipe:boilerpipe) = 1.2.0, mvn(de.l3s.boilerpipe:boilerpipe:pom:) = 1.2.0 Requires: javapackages-filesystem, mvn(net.sourceforge.nekohtml:nekohtml) Processing files: boilerpipe-javadoc-1.2.0-alt1_13jpp8 Executing(%doc): /bin/sh -e /usr/src/tmp/rpm-tmp.462 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + cd boilerpipe-1.2.0 + DOCDIR=/usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-javadoc-1.2.0 + export DOCDIR + rm -rf /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-javadoc-1.2.0 + /bin/mkdir -p /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-javadoc-1.2.0 + cp -prL --no-dereference LICENSE.txt NOTICE.txt /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-javadoc-1.2.0 + chmod -R go-w /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-javadoc-1.2.0 + chmod -R a+rX /usr/src/tmp/boilerpipe-buildroot/usr/share/doc/boilerpipe-javadoc-1.2.0 + exit 0 Finding Provides (using /usr/lib/rpm/find-provides) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.xbTTfr find-provides: running scripts (alternatives,debuginfo,lib,maven,osgi-fc,pam,perl,pkgconfig,python,shell) Finding Requires (using /usr/lib/rpm/find-requires) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.RrIblq find-requires: running scripts (cpp,debuginfo,files,javadoc,lib,maven,osgi-fc,pam,perl,pkgconfig,pkgconfiglib,python,rpmlib,shebang,shell,static,symlinks,systemd-services) Requires: javapackages-filesystem Wrote: /usr/src/RPM/RPMS/noarch/boilerpipe-1.2.0-alt1_13jpp8.noarch.rpm Wrote: /usr/src/RPM/RPMS/noarch/boilerpipe-javadoc-1.2.0-alt1_13jpp8.noarch.rpm 19.93user 1.63system 0:15.72elapsed 137%CPU (0avgtext+0avgdata 277804maxresident)k 0inputs+0outputs (0major+384865minor)pagefaults 0swaps /.out/boilerpipe-1.2.0-alt1_13jpp8.noarch.rpm: license not found in '/usr/share/license' directory: ASL /.out/boilerpipe-1.2.0-alt1_13jpp8.noarch.rpm: license not found in '/usr/share/license' directory: 2.0 /.out/boilerpipe-javadoc-1.2.0-alt1_13jpp8.noarch.rpm: license not found in '/usr/share/license' directory: ASL /.out/boilerpipe-javadoc-1.2.0-alt1_13jpp8.noarch.rpm: license not found in '/usr/share/license' directory: 2.0 41.70user 6.30system 0:43.36elapsed 110%CPU (0avgtext+0avgdata 277804maxresident)k 0inputs+0outputs (0major+1153668minor)pagefaults 0swaps --- boilerpipe-1.2.0-alt1_13jpp8.noarch.rpm.repo 2019-05-26 22:26:46.000000000 +0000 +++ boilerpipe-1.2.0-alt1_13jpp8.noarch.rpm.hasher 2020-08-14 04:19:37.718494478 +0000 @@ -7,3 +7,3 @@ /usr/share/maven-poms/boilerpipe.pom 100644 -Requires: javapackages-tools +Requires: javapackages-filesystem Requires: mvn(net.sourceforge.nekohtml:nekohtml) --- boilerpipe-javadoc-1.2.0-alt1_13jpp8.noarch.rpm.repo 2019-05-26 22:26:46.000000000 +0000 +++ boilerpipe-javadoc-1.2.0-alt1_13jpp8.noarch.rpm.hasher 2020-08-14 04:19:37.737494541 +0000 @@ -215,3 +215,3 @@ /usr/share/javadoc/boilerpipe/stylesheet.css 100644 -Requires: javapackages-tools +Requires: javapackages-filesystem Requires: rpmlib(PayloadIsLzma)