Wasif Haque
Available for opportunities

Wasif Haque

Data Scientist | ML Engineer | Computer Vision Specialist

I build fraud-detection ML systems that are explainable to stakeholders and reliable in production.

From model design to deployment, I focus on measurable impact: faster pipelines, fewer false positives, and maintainable architecture.

4+
Years Experience
10+
ML Models Deployed
75%
Faster OCR Pipeline

Impact Highlights

Three examples of how I turn model work into measurable business outcomes.

Production Model Delivery

Problem: Fraud signals were fragmented across separate tools and hard to operationalize.

Approach: Built deployable models for handwriting verification, signature forgery, and field extraction with shared evaluation standards.

Outcome: Shipped 10+ production models and reduced handoff friction between research and engineering.

OCR Throughput Optimization

Problem: OCR latency constrained fraud-review throughput and customer responsiveness.

Approach: Reworked pipeline bottlenecks, optimized model ordering, and tuned queue processing based on production traces.

Outcome: Delivered a 75% processing-time reduction with faster downstream fraud decisions.

High-Stakes Client Recovery

Problem: Enterprise clients faced elevated false positives affecting operations and trust.

Approach: Led direct diagnostics with stakeholders, identified root-cause thresholds/features, and shipped targeted fixes quickly.

Outcome: Stabilized client performance and improved confidence in production fraud decisions.

About Me

What Drives Me

I'm a Data Scientist with 4+ years of experience turning research ideas into production systems. My focus is document fraud detection across computer vision, deep learning, and platform engineering.

I care about work that ships and stays useful: reducing processing time by 75%, improving model quality in the field, and building systems teams can maintain as requirements evolve.

Machine Learning

Deep Learning, CNNs, Siamese Networks, YOLO, Random Forest, PyTorch, TensorFlow

Computer Vision

OpenCV, Image Processing, OCR, Document Analysis, Feature Extraction

Software Engineering

Python, Java, System Architecture, Performance Optimization, Design Patterns

Fraud Detection

Handwriting Verification, Signature Detection, Check Analysis, Anomaly Detection

Professional Experience

Data Scientist I

ARGO Data - Fraud Detection & Analytics R&D Group Jan 2023 – Present · 2+ yrs

I own end-to-end ML delivery for check fraud detection, from research to production rollout.

Show details

Key Contributions:

  • Developed production models for handwriting verification (Siamese CNNs), signature forgery (Random Forest), and YOLO-based check field extraction
  • Led an architecture migration to a feature graph + plugin model, improving maintainability and feature delivery speed
  • Achieved 75% reduction in OCR processing time through strategic pipeline optimization, making our solution competitive in the market
  • Built a queue-optimization tool that explored thousands of configurations to reduce false positives while preserving fraud capture; the tool became a paid customer service offering
  • Resolved high-stakes client false-positive issues through direct analysis, stakeholder presentations, and rapid corrective releases
  • Mentored junior data scientists and interns, helping them translate research code into production-ready systems
Python PyTorch OpenCV Tesseract Azure OCR TrOCR PaddleOCR YOLO scikit-learn

Software Engineer

ARGO Data - Fraud Detection & Analytics R&D Group Jan 2021 – Dec 2022 · 2 yrs

Bridged R&D and engineering by integrating analytical improvements into the core Java fraud platform.

Show details

Key Contributions:

  • Replaced PMML with M2cgen to modernize model execution and deployment
  • Implemented a Bloom filter for duplicate-check detection, removing expensive database lookups
  • Improved anomaly clustering and serial-number validation through targeted model and threshold tuning
  • Prototyped handwriting-style verification that established the baseline for full check-alteration detection development
Java Spring Boot Python SQL

Graduate Teaching Assistant

The University of Texas at Dallas Aug 2017 – May 2020
Show details
  • Assisted in teaching and grading for graduate-level computer science courses
  • Held office hours to provide academic support to students
  • Courses: Advanced Algorithms Design & Analysis, Software Engineering

Interactive Demos

Experience machine learning concepts through hands-on visualization

Neural Network Visualizer

Build and visualize a neural network architecture. Watch data flow through layers in real-time.

4 neurons
Medium
Layers: 3
Total Neurons: 12
Connections: 48

Generative Art Engine

Create unique algorithmic art with mathematical patterns. Each generation is one-of-a-kind.

Medium
Elements: 0
Pattern: -
Seed: -

Fraud Threshold Simulator

Tune the decision threshold and see precision, recall, review load, and expected business cost update in real time.

Why it matters: threshold tuning is where model quality becomes real-world fraud capture vs. false-positive pain.

0.72
8%
12,000
$8
$220
Legitimate Fraud Threshold
Precision -
Recall -
False Positives / Day -
Expected Cost / Day -

Skills & Technologies

Machine Learning & AI

Deep Learning PyTorch TensorFlow Keras scikit-learn CNNs Siamese Networks YOLO Random Forest Model Training & Evaluation

Computer Vision

OpenCV Image Processing Template Matching OCR (Tesseract) Azure OCR PaddleOCR TrOCR Feature Extraction Document Analysis

Software Engineering

Python Java Spring Boot System Architecture Design Patterns Performance Optimization Plugin Architecture

Data Science

Data Analysis pandas NumPy Matplotlib Jupyter SQL Customer Analytics

Education

Master of Science in Computer Science

The University of Texas at Dallas

2020

Specialization: Intelligent Systems

Focus Areas: Deep Learning, Machine Learning, Software Engineering, and Data Analytics

Bachelor of Science in Computer Science

BRAC University

2017

Graduated with Highest Distinction

Strong foundation in competitive programming and algorithmic problem-solving

Let's Connect

Location

Dallas-Fort Worth Metroplex, Texas

Send Me a Message