A (PyTorch) imbalanced dataset sampler for oversampling low frequent classes and undersampling high frequent ones.
-
Updated
Jul 7, 2025 - Python
A (PyTorch) imbalanced dataset sampler for oversampling low frequent classes and undersampling high frequent ones.
Microarchitectural exploitation and other hardware attacks.
State-of-the-art neural cardinality estimators for join queries
TIP2022 Adaptive Boosting (AdaBoost) for Domain Adaptation ? 🤷♀️ Why not ! 🙆♀️
Reinforced Data Sampling
Implémentation d'un modèle de scoring (OpenClassrooms | Data Scientist | Projet 7)
This project aims to analyze the citation network of arXiv papers. We use Python to clean the data and create a Neo4j network to visualize and analyze the citation relationships between arXiv papers.
This file covers all concepts of R language..
Generating realistic test data or simulating load with authentic, dynamic data using the Gatling framework and JavaFaker
Adaptive data sampling and transmission in a wireless sensor node as a function of energy reserves
BiBiER: A Bilingual Bimodal Emotion Recognition Method with Improved Generalization through Data Augmentation
Process of data preparaton in R.
Here is Task 5: Credit card fraud detection using machine learning, for my data science internship with Codsoft
Professional Python tool for intelligently selecting and copying media files with advanced filtering, performance optimization, and resume capabilities. Perfect for dataset creation, content curation, and large-scale media management.
A Python package for flexible subset selection for data visualization.
Code and Data for paper: Variation across Scales: Measurement Fidelity under Twitter Data Sampling (ICWSM '20)
A Semi-Supervised Multimodal Framework for Multitask Emotion and Personality Traits Recognition Using Cross-Domain Learning
This repository contains experiments on data wrangling techniques, focusing on methods for handling missing values, filtering, aggregation, and more.
This project demonstrates data sampling techniques in Snowflake. It covers loading datasets from S3, performing RANDOM and SYSTEM sampling methods to extract subsets, validating sampled data, and optimizing analysis on datasets.
Add a description, image, and links to the data-sampling topic page so that developers can more easily learn about it.
To associate your repository with the data-sampling topic, visit your repo's landing page and select "manage topics."