Dashboard > Heritrix > ... > 1.12.0 > Issues with 'Fix Version' 1.12.0
Heritrix Log In View a printable version of the current page.
Issues with 'Fix Version' 1.12.0
Added by Karl Thiessen, last edited by Gordon Mohr on Mar 12, 2007  (view change)
Labels: 
(None)

The following Heritrix issues have a 'fix version' of 1.12.0, meaning they are fixed or expected to be fixed for Heritrix release 1.12.0. (This list is dynamically updated from the JIRA Issue Tracking project for Heritrix.)

IA Webteam JIRA (24 issues)
Key Summary T Created Updated Assignee Reporter Pr Status Res
HER-659 filehandle leak: ReplayInputStream/BufferedSeekInputStream Bug Feb 16, 2007 Apr 25, 2007 Karl Thiessen Gordon Mohr Blocker ClosedClosed FIXED
HER-434 "failed get of replay" in ExtractorHTML... usu: UTF-16BE Bug Feb 16, 2007 Mar 22, 2007 Karl Thiessen Gordon Mohr Critical ClosedClosed FIXED
HER-4 robots.txt "crawl-delay" (and "allow") directive breaks parsing Bug Feb 13, 2007 Apr 25, 2007 Karl Thiessen (sourceforge) Critical ClosedClosed FIXED
HER-1080 CrawlURI.getContentDigest for DNS URIs returns digest of zero-length-input Bug Feb 20, 2007 Mar 08, 2007 Gordon Mohr Gordon Mohr Major ClosedClosed FIXED
HER-1086 TransclusionDecideRule should offer lower cap for speculative hops New Feature Mar 03, 2007 Mar 08, 2007 Gordon Mohr Gordon Mohr Major ResolvedResolved FIXED
HER-1090 Method to remove unwanted elements added by parent settings New Feature Mar 13, 2007 Mar 14, 2007 Karl Thiessen Michael Stack Major ResolvedResolved FIXED
HER-1091 v10 of WARCReader is unusable (ClassCastException) Bug Mar 13, 2007 Mar 14, 2007 Karl Thiessen Michael Stack Major ResolvedResolved FIXED
HER-1095 move from Filters to DecideRules is done, but still no replacement for ContentTypeRegExpFilter exists Bug Mar 14, 2007 Mar 19, 2007 Gordon Mohr Olaf Freyer Major ClosedClosed FIXED
HER-804 avoid double-extracting identical documents Improvement Feb 17, 2007 Mar 19, 2007 Karl Thiessen Gordon Mohr Major ClosedClosed FIXED
HER-1079 Carry forward prior-fetch information (content-digest, headers) useful for future recrawls New Feature Feb 20, 2007 Mar 19, 2007 Karl Thiessen Gordon Mohr Major ClosedClosed FIXED
HER-1081 Optionally use conditional-GET headers (If-Modified-Since, If-None-Match) in FetchHTTP if history info available New Feature Feb 20, 2007 Mar 19, 2007 Karl Thiessen Gordon Mohr Major ClosedClosed FIXED
HER-1083 Make extraction and writing dependent on duplicate analysis New Feature Feb 20, 2007 Mar 19, 2007 Karl Thiessen Gordon Mohr Major ResolvedResolved FIXED
HER-650 ExtractorHTML misses inline STYLE elements with comments Bug Feb 16, 2007 Mar 20, 2007 Karl Thiessen Vinay Goel Major ResolvedResolved FIXED
HER-1084 TestCases in SelfTest can't succeed in normal unit test suite, clutter results with spurious failures Bug Feb 21, 2007 Feb 28, 2007 Gordon Mohr Gordon Mohr Minor ResolvedResolved FIXED
HER-1075 crawl log digest field should include digest algorithm Improvement Feb 17, 2007 Mar 08, 2007 Gordon Mohr Michael Stack Minor ResolvedResolved FIXED
HER-1069 WUI: determine number of URL's matching regex in frontier Improvement Feb 17, 2007 Mar 09, 2007 Gordon Mohr (sourceforge) Minor ClosedClosed FIXED
HER-1071 [contrib] StripExtraSlashes canonicalization rule Improvement Feb 17, 2007 Mar 09, 2007 (sourceforge) Michael Stack Minor ClosedClosed FIXED
HER-1072 [contrib] Swedish libraries Kw3WriterProcessor Improvement Feb 17, 2007 Mar 09, 2007 (sourceforge) Michael Stack Minor ClosedClosed FIXED
HER-1074 Add digest using md5 option Improvement Feb 17, 2007 Mar 09, 2007 (sourceforge) Michael Stack Minor ClosedClosed FIXED
HER-1077 Replace per-Processor Filters with DecideRules Improvement Feb 18, 2007 Mar 09, 2007 Gordon Mohr Gordon Mohr Minor ClosedClosed FIXED
HER-1087 [arcreader] Handling of DELETED records Bug Mar 12, 2007 Mar 12, 2007 Karl Thiessen Michael Stack Minor ResolvedResolved FIXED
HER-1088 [arcreader] If ZipException, not-strict, and iterating, skip to next record Bug Mar 12, 2007 Mar 12, 2007 Karl Thiessen Michael Stack Minor ResolvedResolved FIXED
HER-1085 Httpclient failes to fetch URL with '|' in the path Bug Feb 26, 2007 Mar 20, 2007 Karl Thiessen Igor Ranitovic Minor ResolvedResolved FIXED
HER-1078 BDB-JE: use deferred writes more extensively, update to 3.2.13 Improvement Feb 20, 2007 Mar 20, 2007 Karl Thiessen Gordon Mohr Minor ResolvedResolved FIXED

Site powered by a free Open Source Project / Non-profit License (more) of Confluence - the Enterprise wiki.
Learn more or evaluate Confluence for your organisation.
Powered by Atlassian Confluence, the Enterprise Wiki. (Version: 2.2.10 Build:#528 Nov 29, 2006) - Bug/feature request - Contact Administrators