Building Superset Dashboard and Pipeline using Apache Airflow and Google Cloud SQL

The blog is posted by WeCloudData’s Data Science Bootcamp student Ryan Kang. Like Amazon AWS, Google Cloud is a popular cloud used by data analytics companies. Google Cloud allows continuous automation of workflow and big data computation. In this blog, I will briefly introduce how I set up Google Cloud for workflow. Each Google Cloud account […]
Web Scraping – Fishing Ontario

The blog is posted by WeCloudData’s Data Science Bootcamp student Weichen Lu. Once, I was talking with my colleague about outdoor activities, and he told me that he is a fishing enthusiast. It didn’t bring up my attention at first since I am not a fishing guy. However, he proposed an idea to use Google […]
Visualizing New York City Taxi Data

[Student Project] Visualizing New York City Taxi Data This blog is created by WeCloudData’s Data Science Bootcamp alumni Yaoyu Cui. Please find the complete dashboard on https://goo.gl/gXGTEw Tableau has been one of the most popular visualization tools among the Data Science community. Besides its ability of data preprocessing and programming, it also provides powerful mapping […]
Credit Scoring Using Machine Learning

The credit score is a numeric expression measuring people’s creditworthiness. The banking usually utilizes it as a method to support the decision-making about credit applications. In this blog, I will talk about how to develop a standard scorecard with Python (Pandas, Sklearn), which is the most popular and simplest form for credit scoring with machine […]
Fraud Analytics: ML Tutorial on Dealing with an Imbalanced Dataset

This blog is posted by WeCloudData’s Immersive Bootcamp student Anthony Chen. Fraud analytics provide a certain challenge that people may glance over at first. The problem of the imbalanced dataset. How do we approach it? What angle should we start at? What kind of performance measures do we use? The goal of this article is […]
Building Digital Marketing Dashboard Using Python, Docker, Airflow in Google Cloud (Part-2)

This blog series is posted by WeCloudData’s Data Science Immersive Bootcamp student Bob Huang (Linkedin) Continuing from the first half of the digital marketing blog post, This is Part 2 that mainly focusing on the data analysis business insights of different social platforms Email: There are lots of information in our emails. We can write codes […]
Building Digital Marketing Dashboard Using Python, Docker, Airflow in Google Cloud (Part 1)

This blog series is posted by WeCloudData’s Data Science Immersive Bootcamp student Bob Huang (Linkedin) OVERVIEW: The digital marketing project gives you the ability to manage and analyze your marketing data from different platforms such as Google Analytic, Gmail, Eventbrite, and Google Ad. You can find your emails based on their sent status, campaign, and […]
Interview with Shaohua Zhang, Data Scientist and CEO of WeCloudData – by Reena Shaw

This is a repost of Reena Shaw’s interview with Shaouhua Zhang CEO WeCloudData published on Medium. Thanks, Reena (Linkedin Medium) for doing this interview! During my interviews with various data scientists, Shaohua Zhang is someone who struck me as unique for two reasons: 1) his incredible commitment and generosity to share his experience, and 2) […]
Introduction to Machine Learning In Healthcare

Machine learning applications in healthcare was a great hit with the NYC audience. At least 130 enthusiastic attendees joined the Bots and AI Meetup on December 10th, with the crowd extending far to the back of the room.</p></p> agraph=””>Lucy He of Flatiron Health kicked off the night with an examination of machine learning’s impact in medical study cohort selection. […]