All Versions
116
Latest Version
Avg Release Cycle
18 days
Latest Release
1552 days ago

Changelog History
Page 8

  • v0.8.8 Changes

    August 04, 2013
    • JavaScript Tracker: moved into own repo (#277)
    • Hadoop ETL: bumped to 0.3.3
    • Hadoop ETL: URL-decodes "%3D" to "=" to allow Hive-style directory names as arguments (#305)
    • Hadoop ETL: bumped referer-parser to 0.1.1 to fix java.lang.NullPointerException (#314)
    • EmrEtlRunner: bumped to 0.4.0
    • EmrEtlRunner: bumped Sluice to 0.0.7 (#299)
    • EmrEtlRunner: removed :snowplow: section from config.yml.sample (#289)
    • EmrEtlRunner: simplified EmrEtlRunner and its config (#287)
    • EmrEtlRunner: added run= to timestamped ETL folder names (#294)
    • EmrEtlRunner: updated "Jobflow started" stdout message to include jobflow ID (#315)
    • Hive ETL: removed folder 3-enrich/hive-etl as no longer supported (#286)
    • Hive storage: updated hive-storage scripts to work with current Redshift-format flatfile (#290)
    • Infobright: removed folder 4-storage/infobright as not currently supported (#285)
    • Postgres: add Postgres table definition in atomic schema (#160)
    • StorageLoader: bumped to 0.1.0
    • StorageLoader: bumped Sluice 0.0.7 (#300)
    • StorageLoader: removed code to delete Hive ETL's empty event files (#306)
    • StorageLoader: fixed bug where download path has to be set (even when using Redshift) (#280)
    • StorageLoader: optimized ANALYZE and VACUUM commands (#283)
    • StorageLoader: added MAXERROR as StorageLoader configuration value for Redshift (#273)
    • StorageLoader: added support for loading Postgres (#161)
    • StorageLoader: removed Infobright loading capability (#307)
    • StorageLoader: added support for loading into multiple storage targets (#311)
  • v0.8.7 Changes

    July 07, 2013
    • JavaScript Tracker: bumped to 0.12.0
    • JavaScript Tracker: fixed document reference to use documentAlias (#247)
    • JavaScript Tracker: fixed bug with setCustomUrl (#267)
    • JavaScript Tracker: changed ev_ to se_ for structured events (#197)
    • JavaScript Tracker: fixed Firefox failure when "Always ask" set for cookies (#163)
    • JavaScript Tracker: fixed bug in page ping functionality detected in IE 8 (#260)
    • JavaScript Tracker: replaced forEach as not supported in IE 6-8 (#295)
    • EmrEtlRunner: fixed bug in config.yml.sample (#291)
    • Arduino tracker: added git submodule link (#292)
  • v0.8.6 Changes

    June 03, 2013
    • Hadoop ETL: bumped to 0.3.2
    • Hadoop ETL: bumped Scalding to 0.8.5
    • Hadoop ETL: bumped Scala version to 2.10.0
    • Hadoop ETL: bumped scala-maxmind-geoip to 0.0.5 to work with Scala 2.10.0
    • Hadoop ETL: bumped SBT from 0.12.1 to 0.12.3
    • Hadoop ETL: bumped Specs2 to 1.14
    • Hadoop ETL: replaced Bytes in CanonicalOutput with JBytes (#254)
    • Hadoop ETL: disabled "corruption" detection in ETL overriding custom URLs with longer collector referer URLs (#268)
    • EmrEtlRunner: bumped to 0.3.0
    • EmrEtlRunner: updated config.yml.sample to support spot task instances
    • EmrEtlRunner: let EmrEtlRunner use spot task instances (#193)
    • EmrEtlRunner: consolidate small files prior to running ETL job (#207)
  • v0.8.5 Changes

    May 24, 2013
    • Hadoop ETL: bumped to 0.3.1
    • Hadoop ETL: now supports downloading GeoLiteCity.dat from public S3 URL if needed, thanks @petervanwesep (part of #258)
    • Hadoop ETL: added Twitter Maven Repo as a resolution repo, thanks @rgabo (#239)
    • Hadoop ETL: stripping control characters in addition to tabs and newlines (#259)
    • Hadoop ETL: fixed issue with large values for se_value (#263)
    • Hadoop ETL: renamed ev_ fields in CanonicalOutput to se_
    • Hadoop ETL: extractResolution renamed and fails gracefully if view dimensions exceed Integer max size (#264)
    • EmrEtlRunner: bumped to 0.2.1
    • EmrEtlRunner: returns public S3 URL to GeoLiteCity.dat file if hosted by Snowplow, thanks @petervanwesep (part of #258)
    • Redshift: table-def script bumped to 0.2.1
    • Redshift: migration script added for 0.2.0 to 0.2.1
    • Redshift: bumped se_value from a float to a double
    • Redshift: increased size of _urlport fields, thanks @petervanwesep (#266)
    • Infobright: bumped setup_ and verify_infobright.sql to 0.0.9
    • Infobright: added migration script 0.0.8->0.0.9
    • Infobright: increased size of _urlport fields, thanks @petervanwesep (#266)
  • v0.8.4 Changes

    May 16, 2013
    • Hadoop ETL: bumped to 0.3.0
    • Hadoop ETL: added geo-ip lookup to Scalding ETL
    • Hadoop ETL: bumped referer-parser from 0.1.0-M6 to to 0.1.0
    • Hadoop ETL: removed truncation of page_referrer (#236)
    • Hadoop ETL: added truncation of referer path/qs/fragment (#235)
    • Hadoop ETL: removing tabs found in referer search terms (#234)
    • Hadoop ETL: fixed client timestamp so it's not incorrectly localised - thanks @rgabo (#238)
    • Hadoop ETL: added parsing of collector version cv (#243)
    • Hadoop ETL: bumped Scalaz from 7.0.0-M9 to 7.0.0
    • Hadoop ETL: removed .gets from extractPageUri (#249)
    • EmrEtlRunner: bumped to 0.2.0
    • EmrEtlRunner: now passes MaxMind .dat file into Scalding ETL (#213)
    • EmrEtlRunner: improve messages when ETL job starts and fails (#230)
    • Redshift: table-def script bumped to 0.2.0
    • Redshift: migration script added for 0.1.0 to 0.2.0
    • Redshift: added geo-ip fields to Redshift table definition (#226)
    • Redshift: rename ev_ fields to se_ for structured events (#227)
  • v0.8.3 Changes

    May 14, 2013
    • JavaScript Tracker: bumped to 0.11.2
    • JavaScript Tracker: added unstructured events, thanks @rgabo, @tarsolya, @lackac (#198)
    • JavaScript Tracker: remove leading ampersand in querystring (#188)
    • Clojure Collector: bumped to 0.5.0
    • Clojure Collector: upgraded to use Tomcat AccessLogValve 0.0.4 (#240)
    • Clojure Collector: now logging Clojure Collector and Tomcat AccessLogValve versions (#239)
    • Common: completed splitting custom event type into: unstructured and structured events (#133)
  • v0.8.2 Changes

    May 08, 2013
    • Clojure Collector: bumped to 0.4.0
    • Clojure Collector: remove duplicate of wrap-request-logging in middleware.clj (#221)
    • Clojure Collector: check/potentially bump lein-ring dependency in project.clj (#222)
    • Clojure Collector: simplify building Clojure Collector, thanks @butlermh (#223, #225)
    • Clojure Collector: fix Tomcat log bug of missing cs(Referer) (#220)
  • v0.8.1 Changes

    April 12, 2013
    • Hadoop ETL: bumped to 0.2.0
    • Hadoop ETL: break referer_url into constituent parts (part of #175)
    • Hadoop ETL: remove raw referrer_url (as no space in Redshift table defn) (part of #175)
    • Hadoop ETL: added referer parsing (#176)
    • Redshift: table-def script bumped to 0.1.0
    • Redshift: migration script added for 0.0.1 to 0.1.0
    • Redshift: add/update referer fields in Redshift table definition (#204)
    • Redshift: fix bug where mkt_source and mkt_medium are getting swapped around (#215)
    • Common: replaced embedded architecture images with CloudFront-hosted images
    • Common: completed rename of 3-etl to 3-enrich (#99)
    • Common: "SnowPlow" -> "Snowplow" in 1st and 2nd level READMEs
  • v0.8.0 Changes

    April 03, 2013
    • Hadoop ETL: added. Version 0.1.0 (#177)
    • Hadoop ETL: truncate 6 "high risk" fields for Redshift (raw useragent, page title etc) (#192)
    • Hadoop ETL: ev_value now extracted as a float (#201)
    • EmrEtlRunner: bumped to 0.1.0
    • EmrEtlRunner: updated to work with new config.yml fields (part of #178)
    • EmrEtlRunner: added support for Hadoop ETL (part of #178)
    • EmrEtlRunner: added run ID and human-friendly job name (#100)
    • EmrEtlRunner: added run IDs to output folders (Hadoop ETL only) (#79)
    • EmrEtlRunner: changed .rvmrc to .ruby-version, thanks @richo (part of #190)
    • StorageLoader: changed .rvmrc to .ruby-version, thanks @richo (part of #190)
    • StorageLoader: added final missing /Gemfile to BUNDLE_GEMFILE in Bash script, thanks @frutik (#206)
    • Common: started rename of 3-etl to 3-enrich (part of #99)
  • v0.7.6 Changes

    March 03, 2013
    • HiveQL: redshift-etl.q added. Version 0.0.1 (#174)
    • HiveQL: hive-rolling-etl.q renamed to hive-etl.q and bumped to 0.5.7
    • HiveQL: non-hive-rolling-etl.q renamed to mysql-infobright-etl.q and bumped to 0.0.8 (part of #172)
    • EmrEtlRunner: bumped to 0.0.9
    • EmrEtlRunner: renamed :snowplow: variable names and added new Redshift one in config.yml (part of #172)
    • EmrEtlRunner: updated to support Redshift as a storage format (#173)
    • EmrEtlRunner: added missing /Gemfile to BUNDLE_GEMFILE in Bash script
    • StorageLoader: bumped to 0.0.5
    • StorageLoader: added Redshift-specific fields to config.yml (part of #159)
    • StorageLoader: added Redshift load support into StorageLoader (part of #159)
    • StorageLoader: added missing /Gemfile to BUNDLE_GEMFILE in Bash scripts
    • Redshift: table-def.sql script added. Version 0.0.1 (#158)
    • Infobright: bumped setup_ and verify_infobright.sql to 0.0.8
    • Infobright: widened useragent field (#184)
    • Infobright: added migration script 0.0.7->0.0.8
    • Serde: fixed and enabled broken tests (#14). Version unchanged