How-To Guide
Getting Started
Welcome to the Data Cleansing Application! This guide will help you understand how to use the application to clean your data files.
Step 1: Upload Your File
You can upload your data file in two ways:
- Drag and Drop: Simply drag your file from your computer and drop it onto the upload area on the main page.
- Click to Browse: Click the "Select File" button and choose your file from the file browser.
Supported file formats: CSV, JSON, Excel (.xlsx, .xls), TSV, TXT, DAT
Maximum file size: 10 MB
Step 2: Review Default Transformations
Once your file is uploaded, the application will automatically apply the following default transformations to all data:
- Trim whitespace: Removes leading and trailing spaces from all values
- Replace multiple spaces: Converts multiple consecutive spaces into a single space
- Replace tabs: Converts tab characters into single spaces
These transformations are applied automatically to ensure basic data cleanliness.
Step 3: Select Optional Transformations (Structured Data Only)
If your file contains structured data (CSV, JSON, Excel, TSV), you'll see additional transformation options for each field:
- Uppercase: Converts all text in the field to uppercase letters
- Lowercase: Converts all text in the field to lowercase letters
- Remove Duplicates: Removes duplicate rows based on the values in this field
You can select multiple transformations for each field by checking the corresponding boxes.
Note: For unstructured data files (plain TXT, DAT), only the default transformations will be applied.
Step 4: Process and Download
Once you've selected your desired transformations:
- Click the "Process & Download" button
- The application will process your file in your browser (no data is sent to any server)
- Your cleansed file will automatically download in the same format as the original
- The downloaded file will have "_cleansed" added to the filename
Important Notes
- Privacy: All processing happens in your browser. Your files are never uploaded to any server.
- File Size: Files larger than 10 MB cannot be processed due to browser memory limitations.
- Browser Compatibility: This application works best on modern browsers (Chrome, Firefox, Safari, Edge).
- Data Integrity: Always verify your cleansed data before using it in production systems.
Tips for Best Results
- Review your data structure before uploading to understand which transformations you need
- For CSV files, ensure they have proper headers in the first row
- Test with a small sample file first to verify the transformations work as expected
- Keep a backup of your original file before processing