Tullio Facchinetti
Tag cloud
Picture wall
Daily
RSS Feed
  • RSS Feed
  • ATOM Feed
  • Daily Feed
Filters

Links per page

  • 20 links
  • 50 links
  • 100 links

Display

Filter untagged links
10 results tagged dataset  ✕
OpenRefine https://openrefine.org/
Mon 23 Mar 2020 09:15:31 PM CET
QRCode

OpenRefine (previously Google Refine) is a powerful tool for working with messy data: cleaning it; transforming it from one format into another; and extending it with web services and external data.

OpenRefine always keeps your data private on your own computer until YOU want to share or collaborate. Your private data never leaves your computer unless you want it to. (It works by running a small server on your computer and you use your web browser to interact with it)

data_science dataset homepage opensource software
The Fortran 77 codes for the open-loop and the closed-loop simulations for the Tennessee Eastman process (TEP). https://github.com/camaramm/tennessee-eastman-profBraatz
Wed 11 Mar 2020 10:55:57 PM CET
QRCode

The Fortran 77 codes for the open-loop and the closed-loop simulations for the Tennessee Eastman process (TEP) as well as the training and testing data files used for evaluating the data-driven methods (PCA, PLS, FDA, and CVA).

automation coding_lang:fortran dataset research simulation software
Million Song Dataset http://millionsongdataset.com/
Tue 29 Oct 2019 10:20:17 PM CET
QRCode
dataset homepage
The Dataverse Project https://dataverse.org/#
Sun 06 Oct 2019 11:53:43 PM CEST
QRCode

The Dataverse Project - Dataverse.org

dataset research science
Introducing the CodeSearchNet challenge https://github.blog/2019-09-26-introducing-the-codesearchnet-challenge/
Sat 28 Sep 2019 08:38:16 PM CEST
QRCode

GitHub announces the CodeSearchNet Challenge and releasing a large dataset for natural language processing and machine learning.

article dataset machine_learning
frictionlessdata | A Python library for working with Data Packages https://github.com/frictionlessdata/datapackage-py
Fri 06 Sep 2019 04:07:40 PM CEST
QRCode
coding_lang:python csv data dataset file_format json library machine_learning python source_code
Estimating the success of re-identifications in incomplete datasets using generative models | Nature Communications https://www.nature.com/articles/s41467-019-10933-3
Mon 29 Jul 2019 04:58:54 AM CEST
QRCode

Anonymization has been the main means of addressing privacy concerns in sharing medical and socio-demographic data. Here, the authors estimate the likelihood that a specific person can be re-identified in heavily incomplete datasets, casting doubt on the adequacy of current anonymization practices.

dataset machine_learning paper
Carburanti – Archivio storico dei prezzi praticati e dell'anagrafica degli impianti https://www.mise.gov.it/index.php/it/open-data/elenco-dataset/2036944-carburanti-archivio-prezzi
Fri 19 Apr 2019 11:18:00 AM CEST
QRCode

Archivio dei dataset pubblicati da marzo 2015 raggruppati per trimestre fornito dal MISE.

Per la ricerca e la consultazione in tempo reale dei prezzi praticati e la ricerca degli impianti è possibile consultare il sito dell’Osservatorio prezzi carburanti.

I dati sono in formato .csv. A causa delle notevoli dimensioni i file sono comunque compressi come tar.gz.

dataset locale:it
How to Use Correlation to Understand the Relationship Between Variables https://machinelearningmastery.com/how-to-use-correlation-to-understand-the-relationship-between-variables/
Sun 14 Oct 2018 10:51:42 PM CEST
QRCode

There may be complex and unknown relationships between the variables in your dataset.

It is important to discover and quantify the degree to which variables in your dataset are dependent upon each other. This knowledge can help you better prepare your data to meet the expectations of machine learning algorithms, such as linear regression, whose performance will degrade with the presence of these interdependencies.

In this tutorial, you will discover that correlation is the statistical summary of the relationship between variables and how to calculate it for different types variables and relationships.

After completing this tutorial, you will know:

  • How to calculate a covariance matrix to summarize the linear relationship between two or more variables.
  • How to calculate the Pearson’s correlation coefficient to summarize the linear relationship between two variables.
  • How to calculate the Spearman’s correlation coefficient to summarize the monotonic relationship between two variables.
dataset statistics tutorial
How to Model Human Activity From Smartphone Data https://machinelearningmastery.com/how-to-model-human-activity-from-smartphone-data/
Sat 06 Oct 2018 07:14:20 PM CEST
QRCode

Human activity recognition is the problem of classifying sequences of accelerometer data recorded by specialized harnesses or smart phones into known well-defined movements.

It is a challenging problem given the large number of observations produced each second, the temporal nature of the observations, and the lack of a clear way to relate accelerometer data to known movements.

Classical approaches to the problem involve hand crafting features from the time series data based on fixed-sized windows and training machine learning models, such as ensembles of decision trees. The difficulty is that this feature engineering requires deep expertise in the field.

Recently, deep learning methods such as recurrent neural networks and one-dimensional convolutional neural networks, or CNNs, have been shown to provide state-of-the-art results on challenging activity recognition tasks with little or no data feature engineering.

In this tutorial, you will discover the ‘Activity Recognition Using Smartphones‘ dataset for time series classification and how to load and explore the dataset in order to make it ready for predictive modeling.

article blog coding_lang:python dataset human_activity_recognition machine_learning sensors
3654 links
Shaarli - The personal, minimalist, super-fast, database free, bookmarking service by the Shaarli community - Theme by kalvn