Machine Learning Data Preparation Snap Pack
Snaps to prepare data for machine learning operations.
Overview
ML Data Preparation Snap Pack is a part of SnapLogic Data Science (Machine Learning) Snaps. The Snaps in this Snap Pack are useful in preparing the data upon which machine learning operations are to be performed.
Use Snaps in this Snap Pack to:
- Categorical to Numeric: Convert categorical data to numeric and vice versa.
- Clean Missing Values: Handle missing values in a dataset.
- Date Time Extractor: Extract datetime components.
- Deduplicate: Identify and remove duplicate records from datasets.
- Feature Synthesis: Create features out of multiple datasets that share a one-to-one or one-to-many relationship with each other.
- Mask: Mask sensitive information in your dataset before exporting the dataset for analytics.
- Match: Identify matched records across datasets.
- Principal Component Analysis: Perform Principal Component Analysis (PCA) on an input document.
- Sample: Generate samples from a dataset.
- Scale: Scale/transform data.
- Shuffle: Randomly shuffle the order of documents in a dataset.
- Type Converter: Convert datatypes.