Tullio Facchinetti
Tag cloud
Picture wall
Daily
RSS Feed
  • RSS Feed
  • ATOM Feed
  • Daily Feed
Filters

Links per page

  • 20 links
  • 50 links
  • 100 links

Display

Filter untagged links
17 results tagged statistics  ✕
Principal Component Analysis http://www.oranlooney.com/post/ml-from-scratch-part-6-pca/
Wed 09 Oct 2019 07:46:37 PM CEST
QRCode

PCA is a linear dimensionality reduction technique. Many non-linear dimensionality reduction techniques exist, but linear methods are more mature, if more limited.

algorithm analytics article data_science methodology statistics
scc | Sloc, Cloc and Code: scc is a very fast accurate code counter with complexity calculations https://github.com/boyter/scc/
Tue 01 Oct 2019 09:12:31 AM CEST
QRCode

Sloc, Cloc and Code: scc is a very fast accurate code counter with complexity calculations and COCOMO estimates written in pure Go.

coding_lang:go opensource programming software statistics
Processing 40 TB of code from ~10 million projects with a dedicated server and Go for $100 | Ben E. C. Boyter https://boyter.org/posts/an-informal-survey-of-10-million-github-bitbucket-gitlab-projects/
Tue 01 Oct 2019 09:08:49 AM CEST
QRCode
article language programming statistics
Goodhart’s Law: Are Academic Metrics Being Gamed? https://thegradient.pub/over-optimization-of-academic-publishing-metrics/
Wed 25 Sep 2019 08:21:48 PM CEST
QRCode

Publishing a paper in academia is challenging, stimulating, and a bit baffling. Challenging because the research might fail. Stimulating because research may start assuming one outcome and finish with a totally different one. Baffling because after the paper is written and ready, I have to find it a home for

article research statistics
AutoOut | Automated Outlier Detection and Treatment Tool https://github.com/MateLabs/AutoOut
Sun 01 Sep 2019 09:55:11 PM CEST
QRCode

AutoOut is an automated outlier detection and treatment tool that allows you to get better models with even better accuracy without writing a single line of code. With it's easy to use and simple interface you can detect and treat outliers in your dataset, that can help improve your final model.

coding_lang:python machine_learning opensource software source_code statistics tools
git-stats - Git add-on to get statistics of a repository https://github.com/IonicaBizau/git-stats
Sun 25 Aug 2019 04:50:01 PM CEST
QRCode

"Local git statistics including GitHub-like contributions calendars."

image

#cli-app git homepage opensource software statistics versioning
Logistic Regression from Bayes' Theorem https://www.countbayesie.com/blog/2019/6/12/logistic-regression-from-bayes-theorem
Thu 13 Jun 2019 05:17:24 PM CEST
QRCode

In this post we’ll explore how we can derive logistic regression from Bayes’ Theorem. Starting with Bayes’ Theorem we’ll work our way to computing the log odds of our problem and the arrive at the inverse logit function. After reading this post you’ll have a much stronger intuition for how logistic

algorithm article bayesian machine_learning statistics techniques
Significant Pattern Mining for Time Series - Christian Bock https://christian.bock.ml/posts/significant_shapelets/
Thu 13 Jun 2019 08:58:10 AM CEST
QRCode

In the midst of the deep learning hype, p-values might not be the hottest topic in data science. However, association mapping remains a fundamental tool to justify and underpin scientific conclusions. Inspired by an approach for time series classification based on predictive subsequences (i.e shapelets [1]), we developed S3M, a method that identifies short time series subsequences that are statistically associated with a class or phenotype while tackling the multiple hypothesis problem.

article machine_learning research science statistics time_series
BATgrowth - Monitoring Brave Browser adoption https://batgrowth.com/
Sat 03 Nov 2018 03:55:53 PM CET
QRCode

image

When you first start reading about Brave, you learn that it is a new reward system for publishers and a new advertising model.

You may wondered how many publishers are there, and who they were.

batgrowth.com scrapes the web to list websites that are BAT publishers.

browser cryptocurrency list statistics webservice
Using regression trees for forecasting double-seasonal time series with trend in R - Peter Laurinec https://petolau.github.io/Regression-trees-for-forecasting-time-series-in-R/
Fri 02 Nov 2018 03:10:10 PM CET
QRCode

image

You will learn in this post how to:

  • decompose double-seasonal time series
  • detrend time series
  • model and forecast double-seasonal time series with trend
  • use two types of simple regression trees
  • set important hyperparameters related to regression tree
article blog forecasting machine_learning R statistics time_series
Statistical forecasting: notes on regression and time series analysis http://people.duke.edu/~rnau/411home.htm
Tue 16 Oct 2018 06:59:54 PM CEST
QRCode

This web site contains notes and materials for an advanced elective course on statistical forecasting that is taught at the Fuqua School of Business, Duke University. It covers linear regression and time series forecasting models as well as general principles of thoughtful data analysis.

The time series material is illustrated with output produced by Statgraphics, a statistical software package that is highly interactive and has good features for testing and comparing models, including a parallel-model forecasting procedure that I designed many years ago.

The material on multivariate data analysis and linear regression is illustrated with output produced by RegressIt, a free Excel add-in which I also designed. However, these notes are platform-independent. Any statistical software package ought to provide the analytical capabilities needed for the various topics covered here.

5_stars data_science forecasting research statistics time_series
How to Use Correlation to Understand the Relationship Between Variables https://machinelearningmastery.com/how-to-use-correlation-to-understand-the-relationship-between-variables/
Sun 14 Oct 2018 10:51:42 PM CEST
QRCode

There may be complex and unknown relationships between the variables in your dataset.

It is important to discover and quantify the degree to which variables in your dataset are dependent upon each other. This knowledge can help you better prepare your data to meet the expectations of machine learning algorithms, such as linear regression, whose performance will degrade with the presence of these interdependencies.

In this tutorial, you will discover that correlation is the statistical summary of the relationship between variables and how to calculate it for different types variables and relationships.

After completing this tutorial, you will know:

  • How to calculate a covariance matrix to summarize the linear relationship between two or more variables.
  • How to calculate the Pearson’s correlation coefficient to summarize the linear relationship between two variables.
  • How to calculate the Spearman’s correlation coefficient to summarize the monotonic relationship between two variables.
dataset statistics tutorial
ROC curves calculator https://kennis-research.shinyapps.io/ROC-Curves/
Sat 07 Jul 2018 12:43:07 PM CEST
QRCode

A receiver operating characteristic (ROC) is a graph that illustrates the performance of a binary classifier as its discrimination threshold (cutoff) is changed.

The curve is created by plotting the true positive rate (TPR) against the false positive rate (FPR) at various cutoff settings. The true-positive rate is known as sensitivity, the false-positive rate is known as the fall-out and is calculated as (1 - specificity).

The ROC curve is thus a plot of the true positives (TPR) versus the false positives (FPR). The ROC curve can be generated by plotting the cumulative distribution function (area under the probability distribution from - ∞ to + ∞ ) of the correct detection probability in the y-axis versus the cumulative distribution function of the false-alarm probability in x-axis.

data_science math science statistics web
Basketball - About Box Plus/Minus (BPM) http://www.basketball-reference.com/about/bpm.html
Tue 13 Oct 2015 07:59:11 AM CEST
QRCode

Box Plus/Minus (BPM) is a box score-based metric for evaluating basketball players' quality and contribution to the team. It is the latest version of a stat previously called Advanced Statistical Plus/Minus; it is NOT a version of Adjusted Plus/Minus, which is a play-by-play regression metric.

article basketball sport statistics
NBA Real Plus-Minus http://espn.go.com/nba/statistics/rpm/_/sort/RPM
Tue 13 Oct 2015 07:58:35 AM CEST
QRCode

Glossary

  • GP: Games Played
  • MPG: Minutes Per Game
  • ORPM: Player's estimated on-court impact on team offensive performance, measured in points scored per 100 offensive possessions
  • DRPM: Player's estimated on-court impact on team defensive performance, measured in points allowed per 100 defensive possessions
  • RPM: Player's estimated on-court impact on team performance, measured in net point differential per 100 offensive and defensive possessions. RPM takes into account teammates, opponents and additional factors
  • WAR: The estimated number of team wins attributable to each player, based on RPM
article basketball sport statistics
Journal Metrics: Research analytics redefined | Home http://www.journalmetrics.com/
Thu 24 Jul 2014 09:03:36 PM CEST
QRCode

Elsevier introduces IPP, SNIP & SJR: A new perspective in journal metrics for researchers and publishers

paper research science statistics work
OpenHub https://www.openhub.net/
Fri 21 Jul 2006 04:49:01 PM CEST
QRCode

Discover, Track and Compare Open Source.

statistics web
3654 links
Shaarli - The personal, minimalist, super-fast, database free, bookmarking service by the Shaarli community - Theme by kalvn