Customer-facing Data Engineer with 5+ years building scalable enterprise architectures, cloud data pipelines, and AI-powered analytics at Dell Technologies.
Featured Projects
End-to-end AWS pipeline extracting real-time news & tweets with time-series forecasting at 92% accuracy. Built with AWS Lambda, Step Functions, Redshift, Airflow & Streamlit.
View project →ML algorithm with 97% accuracy benchmarking 3 supervised ML models (Linear Regression, Random Forest, Neural Net) against 3 AutoML frameworks including H2O.ai & AutoSklearn.
View on GitHub →ETL pipeline loading 60M+ rows across Postgres, Snowflake, MySQL & Oracle in 70 minutes using Talend. Dashboards in Tableau, Power BI, Qlik Sense & Looker.
View on GitHub →Scraped worldometer.info for all countries & US states, stored in SQL Server, built interactive Plotly dashboards & Tableau Public visualizations. ⭐ 3 GitHub Stars.
View on GitHub →About Me
I'm a Data Engineering Advisor at Dell Technologies where I architect enterprise-scale data pipelines, lead technical proof-of-concepts for Fortune 500 customers, and present executive-level readouts to senior leadership.
Over 5+ years I've deployed automated solutions that impacted $63M in orders, built ML models hitting 86.32% accuracy in production failure forecasting, established 99.9% SLA uptime for executive dashboards, and drove a 65% increase in adoption for the Order Life Cycle process mining initiative at Dell.
Prior to my current role, I worked as a BI Engineer Co-op at Dell (Franklin, MA) where I optimized SQL pipelines saving $40K YoY, and as a Research Assistant & TA at Northeastern University where I mentored 150+ graduate students in Spark, SQL, Python & Power BI.
I hold a Master of Science in Information Systems from Northeastern University (GPA 3.84) and won the Global COVID-19 Hackathon in 2020. I'm fluent at bridging the gap between complex data systems and clear business value for stakeholders at every level.
Career
5+ years designing data solutions across enterprise environments, university research, and strategic business development.
Portfolio
Real-world data engineering, ML, and BI projects — including all public repositories from github.com/jayshilj.
End-to-end AWS app that extracts real-time news, tweets & trend data and forecasts future market trends with 92% accuracy. Uses Airflow for orchestration and Streamlit for the dashboard.
Interest rate prediction algorithm with 97% accuracy. Evaluated 3 supervised ML models (Linear Regression, Random Forest, Neural Networks) and 3 AutoML frameworks (AutoSklearn, H2O.ai, TPOT). Applied MICE imputation and LassoCV feature selection on a dataset of 2.2M+ loan records.
Orchestrated an ETL workflow in Talend loading tables across 4 databases (60M+ rows) in a scheduled manner within 70 minutes. Built BI dashboards in Tableau, Power BI (DAX), Qlik Sense, and Looker. Modelled with ER/Studio and profiled with Alteryx.
Scraped worldometer.info for all countries & US states using BeautifulSoup. Stored cleaned data in SQL Server with a Windows Task Scheduler. Built interactive Plotly dashboards and published Tableau Public visualizations tracking daily COVID-19 trends.
Data warehousing and BI solution for a retail store dataset. Used Alteryx for data profiling and transformation and Talend for ETL workflow. Power BI reports with custom DAX measures for executive dashboards.
In-depth Power BI dashboard with drill-through and historical trend analysis covering school and staff data from 2009 to 2018. Features executive-level KPIs, cross-filtering, and custom DAX measures.
Contact
Open to full-time Data Engineering, Solutions Architecture, and ML Engineering roles. Also available for consulting and technical advisory engagements.
I'm Jayshil Jain — a Data Engineer and Solutions Architect with 5+ years at Dell Technologies. Whether you have a full-time role, a consulting project, or just want to connect, I'd love to hear from you. I typically respond within 24 hours.
Fill out the form below and I'll get back to you within 24 hours.