Additional information about window duration, period and sessions
The Window processor allows you to partition streaming data into several types of time windows.: fixed time windows, sliding windows and session windows.
The Window processor starts a new window every period of time.
This window stores in memory records during a certain time (duration) then sends micro-batches of data to the output.
Fixed Time Windows
-
all windows have the same consistent duration and never overlap
-
only one window is stored in memory at a given time
-
one piece of data is captured in one window
These windows are useful if you want to capture all data all the time.
data:image/s3,"s3://crabby-images/e2a61/e2a619c989915a58a09d79a4863468e02b712a99" alt="Graphical representation of fixed-time windows."
Here, all data with timestamp values from 00:00:00 to 01:00:00 belong to Window 1, data with timestamp values from 01:00:00 to 02:00:00 belong to Window 2, etc.
Sliding windows
-
multiple windows can overlap
-
several windows are stored in memory at the same time
-
elements in a data set can be captured in more than one window
These windows are useful for sampling purposes and for taking running averages of data.
data:image/s3,"s3://crabby-images/08bec/08bec76b0ca726bec4842bfc9bb963796c3e8d9f" alt="Graphical representation of sliding windows with overlapping windows."
Here all data with timestamp values from 00:00:00 to 01:00:00 belong to Window 1, data with timestamp values from 00:30:00 to 01:30:00 belong to Window 2, etc. In this example, you can compute a running average of the past hour's worth of data, updated every 30 minutes.
data:image/s3,"s3://crabby-images/7661b/7661bcc67c55324a9bd3901d8dd77067464d84b1" alt="Graphical representation of sliding windows with windows updated every 30 minutes."
Here all data with timestamp values from 00:00:00 to 01:00:00 belong to Window 1, data with timestamp values from 01:30:00 to 02:30:00 belong to Window 2 and so on. Windows do not overlap and pieces of data are stored in different windows.
Session windows
-
high concentrations of data are grouped into separate windows
-
idle sections of the data stream are filtered out
-
data can be captured in disjoint windows of different sizes
These windows are useful for data that is irregularly distributed with respect to time. For example, a data stream representing user mouse activity may have long periods of idle time interspersed with high concentrations of clicks.
data:image/s3,"s3://crabby-images/931e5/931e591c1784b9ed15b080435ce5f2cfd70b78ca" alt="Graphical representation of session windows."
Here the data that represents some activity is stored within a window that closes when it does not receive data during at least five minutes (duration gap). Windows do not overlap and do not have a fixed start and end time.