How-To Guide

Getting Started

Welcome to the Data Cleansing Application! This guide will help you understand how to use the application to clean your data files.

Step 1: Upload Your File

You can upload your data file in two ways:

  • Drag and Drop: Simply drag your file from your computer and drop it onto the upload area on the main page.
  • Click to Browse: Click the "Select File" button and choose your file from the file browser.

Supported file formats: CSV, JSON, Excel (.xlsx, .xls), TSV, TXT, DAT

Maximum file size: 10 MB

Step 2: Review Default Transformations

Once your file is uploaded, the application will automatically apply the following default transformations to all data:

  • Trim whitespace: Removes leading and trailing spaces from all values
  • Replace multiple spaces: Converts multiple consecutive spaces into a single space
  • Replace tabs: Converts tab characters into single spaces

These transformations are applied automatically to ensure basic data cleanliness.

Step 3: Select Optional Transformations (Structured Data Only)

If your file contains structured data (CSV, JSON, Excel, TSV), you'll see additional transformation options for each field:

  • Uppercase: Converts all text in the field to uppercase letters
  • Lowercase: Converts all text in the field to lowercase letters
  • Remove Duplicates: Removes duplicate rows based on the values in this field

You can select multiple transformations for each field by checking the corresponding boxes.

Note: For unstructured data files (plain TXT, DAT), only the default transformations will be applied.

Step 4: Process and Download

Once you've selected your desired transformations:

  1. Click the "Process & Download" button
  2. The application will process your file in your browser (no data is sent to any server)
  3. Your cleansed file will automatically download in the same format as the original
  4. The downloaded file will have "_cleansed" added to the filename

Important Notes

  • Privacy: All processing happens in your browser. Your files are never uploaded to any server.
  • File Size: Files larger than 10 MB cannot be processed due to browser memory limitations.
  • Browser Compatibility: This application works best on modern browsers (Chrome, Firefox, Safari, Edge).
  • Data Integrity: Always verify your cleansed data before using it in production systems.

Tips for Best Results

  • Review your data structure before uploading to understand which transformations you need
  • For CSV files, ensure they have proper headers in the first row
  • Test with a small sample file first to verify the transformations work as expected
  • Keep a backup of your original file before processing