code-generation Prompts
SQL Query Optimizer and Rewriter
Analyze SQL queries for performance issues and generate optimized versions with proper indexing recommendations, query rewrites, and execution plan analysis.
Recommendation Engine Builder
Generate a complete recommendation system with collaborative filtering, content-based, or hybrid approaches including data preparation, model training, evaluation, and serving logic.
NLP Text Processing Pipeline
Generate a complete NLP text processing pipeline with tokenization, cleaning, feature extraction, and downstream task setup for text classification, entity extraction, or sentiment analysis.
Time Series Forecasting Pipeline
Generate a complete time series forecasting pipeline with data preparation, model selection, validation strategy, and forecast visualization for business planning and demand prediction.
Anomaly Detection System Setup
Generate a complete anomaly detection system with multiple detection algorithms, threshold tuning, alerting logic, and visualization for monitoring data streams or batch datasets.
Data Catalog and Documentation Builder
Generate a comprehensive data catalog with schema documentation, column descriptions, lineage tracking, and usage examples for datasets across your organization.
ML Model Training Pipeline Scaffold
Generate a complete machine learning training pipeline with data splitting, preprocessing, model training, hyperparameter tuning, and experiment logging ready for production deployment.
Jupyter Notebook Template Generator
Generate structured, well-documented Jupyter notebook templates with standard sections, helper utilities, and best practices for reproducible data science workflows.
Statistical Test Selector and Interpreter
Identifies the correct statistical test for your data and research question, then generates complete analysis code with proper assumptions checking, test execution, and plain-language interpretation.
Data Quality Checker and Profiler
Generate a comprehensive data quality profiling and validation system that detects anomalies, enforces schema constraints, and produces detailed quality reports for any dataset.
ETL Pipeline Designer
Generate a complete ETL (Extract, Transform, Load) pipeline with data extraction from multiple sources, transformation logic, error handling, and loading into target data stores.
A/B Test Statistical Analyzer
Generate a complete A/B test analysis pipeline with sample size calculation, statistical testing, confidence intervals, and decision-ready visualizations for experiment evaluation.