Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists by LuisFalva
A curated list of projects in awesome lists by LuisFalva .
https://github.com/luisfalva/ophelia
Ophelian On Mars! More than a simple framework.
dask dataframe ophelia ophelia-spark rdd spark spark-ml spark-mllib spark-streaming
Last synced: 17 Dec 2024
https://github.com/luisfalva/sparksmote
In practice, in the high-dimensional setting only k-NN classifiers based on the Euclidean distance seem to benefit substantially from the use of SMOTE; the benefit is larger if more neighbors are used. SMOTE for k-NN without variable selection should not be used, because it strongly biases the classification towards the minority class.
Last synced: 24 Dec 2024
https://github.com/luisfalva/archipelagochallenge
This bitso challenge project intends to be the solution for this Archipelago issue.
Last synced: 24 Dec 2024
https://github.com/luisfalva/timeseriessarima
This repo contains all my research based on a AirPassenger data set forecasting the amount of passengers according to seasonal oscillations, my first approach is based on a Box & Jenkins method for ARIMA modelation with seasonality component.
Last synced: 24 Dec 2024
https://github.com/luisfalva/titanic_kernel_svm_mlops
A Kernel SVM model into production
Last synced: 24 Dec 2024
https://github.com/luisfalva/entropyginiprobability
EntropyGiniProbability repo is one of my favourite class notes that I took on college days. It aim to explain in a mathematical way the difference between Gini and Entropy measurements and when to use one of them.
Last synced: 24 Dec 2024
https://github.com/luisfalva/demodask
Dask has utilities and documentation on how to deploy internally, in the cloud, or on HPC supercomputers. Supports encryption and authentication using TLS / SSL certificates. It's tough and can handle failure of worker nodes gracefully and it's springy so you can take advantage of new nodes added on the fly. Dask includes several user APIs that are used and refined by thousands of researchers around the world working in different domains.
Last synced: 24 Dec 2024
https://github.com/luisfalva/wine_quality_dataset_analysis
This repo contains all my works based on a Wine Quality dataset, my first approach is based on a multiple linear model with Jupyter's R kernel. This is just the beginning of the model, I am trying to create a multinomial regression.
Last synced: 24 Dec 2024
https://github.com/luisfalva/pessoatherapy
Pessoa-Therapy is a college project, this aim to be a tool for psychotherapists and psychiatrists to support which type of medication is appropriate according to the multifactorial behaviour from a patient.
Last synced: 24 Dec 2024