How it Works
Last updated
Was this helpful?
Last updated
Was this helpful?
Datastream is a raw data pipeline that delivers real-time, user-level data from visitor interactions on a page, streamed to your Amazon S3 or Google Cloud Storage bucket.
Our feed contains over 50 fields of data. These can be categorized into four broad groups:
Engagement: Chartbeat’s best-in-class engagement metrics, such as engaged time, time on page, scroll depth, page and browser geometry.
Data about the page: Data points related to the identity of the page, such as the path, title, section and author, content type, platform, and sponsor data associated with each page view.
Data about the user: For user level analysis, a unique ID, their browser’s user agent string, frequency, and recency.
Timestamp: The time the visitor visited the page, left the page, and user's time zone.
Some of the Platform data supported in Datastream include:
Web
Google AMP
Facebook Instant Articles
Apple News
Your own native app
Chartbeat’s Datastream Reporting supports exporting data to the following data storage platforms:
Amazon Web Services
Google Cloud Storage
File Format: CSV, one row per Chartbeat-logged page session-expired page view
Compression Type: GZIP
Delimiters: pipe-separated
Character Encoding: UTF-8
Example File Naming Convention: rawdata/YYYY/MM/DD/h/[00|30]/[epoch timestamp].[file hash].csv.gz
Data Batch Interval: by minute
Delivery Frequency: by minute
Delivery Destination: Amazon S3 or GCS bucket with shared read/write permissions
Click the link below to download a sample data CSV file or preview a row of pageview data.
With , website owners can populate their Datastream feed with custom ID values via a few extra lines of JavaScript in our tracking snippet for standard websites. This custom metadata can be used to join a user’s Chartbeat engagement data to other data sources, or to enrich engagement data by specifying information about the current viewing session.