Rudimentary analysis of internet usage by country (AT, CH, DE, IL, GB, CZ) and age
Some notes and points based on EDA of the data. Some nice box-plots created by Plotly.
Somehow advanced usage of python's enums. Including a discussion on testing of those custom entities.
Motivation Consider the case where you implement some logic somewhere, and this logic should be used from within several different places. Think of a logic and two apps using it. The logic should be yielding logging messages that would be visible as part of the running of the different apps …
You will build a RESTful API exposing a trained prediction model.
Tutorial on how to start a cluster of dask instances on AWS (EC2). Using this cluster execute an expansive grid search.
When grouping by DataFrame the order does matter and may be surprising.
Trying to give an intuitive understanding what's the difference between a biased and unbiased estimators of variance of a sample.
A gotcha when aggregated time series data involving hourly based counts.
(Original notebooks can be found in this gist) Assume you have data set as follows: ID Date Value x x x where each row contains an ID, a date (given as pd.Datetime) and a value. The objective is to count how many rows occur in each day. import pandas …