Program  

Courses
Location
Corporate
Our Students
Resources
Bootcamp Programs
Short Courses
Portfolio Courses
Bootcamp Programs

Launch your career in Data and AI through our bootcamp programs

  • Industry-leading curriculum
  • Real portfolio/industry projects
  • Career support program
  • Both Full-time & Part-time options.
Data Science & Big Data
Data Engineering

Become a data analyst through building hands-on data/business use cases

Become an AI/ML engineer by getting specialized in deep learning, computer vision, NLP, and MLOps

Become a DevOps Engineer by learning AWS, Docker, Kubernetes, IaaS, IaC (Terraform), and CI/CD

Short Courses

Improve your data & AI skills through self-paced and instructor-led courses

  • Industry-leading curriculum
  • Portfolio projects
  • Part-time flexible schedule
AI ENGINEERING
Portfolio Courses

Learn to build impressive data/AI portfolio projects that get you hired

  • Portfolio project workshops
  • Work on real industry data & AI project
  • Job readiness assessment
  • Career support & job referrals

Build data strategies and solve ML challenges for real clients

Help real clients build BI dashboard and tell data stories

Build end to end data pipelines in the cloud for real clients

Location

Choose to learn at your comfort home or at one of our campuses

Corporate Partners

We’ve partnered with many companies on corporate upskilling, branding events, talent acquisition, as well as consulting services.

AI/Data Transformations with our customized and proven curriculum

Do you need expert help on data strategies and project implementations? 

Hire Data, AI, and Engineering talents from WeCloudData

Our Students

Meet our amazing alumni working in the Data industry

Read our students’ stories on how WeCloudData have transformed their career

Resources

Check out our events and blog posts to learn and connect with like-minded professionals working in the industry

Let’s get together and enjoy the fun from treasure hunting in massive real-world datasets

Read blogs and updates from our community and alumni

Explore different Data Science career paths and how to get started

Blog

Student Blog

Building an End to End Analytics Pipeline Using Einstein Analytics, Kinesis, Spark and Redshift.

October 13, 2020

The blog is posted by WeCloudData’s  student Sneha Mehrin.

If you are a computer programmer or working in any tech-related industry, then chances are that, at least once a day google for answers in Stack Overflow.

Stack Overflow is a question and answer site for professional and enthusiast programmers. The website offers a platform for users to ask and answer questions, and through active participation to vote questions and answers up or down.

This series is aimed at providing a comprehensive view on buildingdesigning and developing an analytics/AI data pipeline for stack overflow using the AWS stack and finally build a dashboard in Einstein Analytics.

Pipelines are the heart of analytics and ML and quite often this is the hardest part of an analytics or ML problem. If you have a well-designed pipeline, then half your battle is over.

Since this is going to be a long post, I wanted to cover this in 6 different articles. Feel free to jump to any article that piques your interest.

So let’s dive straight to it!!

Key Steps in any Project Pipeline

project pipeline

 

Understanding Business Requirement

The first step in designing an analytics or data science project is to understand how it can drive value to the end-users.

There are two ways we can understand this :

So Then Who Might Be the Stack Overflow Users?

stackoverflow graph

Stack Overflow Users

Let’s Understand Our Users in a bit more Detail!!

Understanding our users is critical in gathering business requirements and UX plays a key role here. Any well-designed pipeline is useless if it doesn’t satisfy the needs of the user.

man drinking from a cup explaining why user experience is important

Creating User Persona’s is one way to help guide the ideation process and understand the needs, expectation and behaviour of different users.

Personally, I have found user research and persona’s to be very effective in designing dashboards and huge lifesaver in terms of time and efficiency.

So let’s look at the persona’s developed after doing some mock user -research.

I want to focus on the internal users here, because most likely they will the ones taking advantage of the dashboards.

However, if your pipelines are well designed, then it can be scaled and re-used for any use case such as an ML problem.

1. UX Persona for an Internal user

 

Photo Courtesy: ThriveGlobal.com , https://www.interaction-design.org/

2. UX Persona Of a Developer

UX-Persona for a Developer

Key Take Away’s from UX Research

well designed pipeline can also bring all the required data in a centralised repository which can be used for a highly interactive visualisation.

Automatic Prediction of Tags can be a great way to minimise user input. This is an ML use case and if our pipelines are well designed, then it can be definitely used for this purpose.

Summary Of Our Business Requirements

Now that we have our 2 persona’s and their pain points addressed, let us capture this in the form of a user story.

Now, let’s understand how to conceive a technical architecture for this business requirement.

This is explained in this article!

To find out more about the courses our students have taken to complete these projects and what you can learn from WeCloudData, view the learning path. To read more posts from Sneha, check out her Medium posts here.

Other blogs you might like
Student Blog
The blog is posted by WeCloudData’s student Luis Vieira. I will be showing how to build a real-time dashboard on…
by Student WeCloudData
October 21, 2020
Uncategorized
Take a central role The Bank of Canada has a vision to be “a leading central bank—dynamic, engaged and…
by Shaohua Zhang
May 21, 2020
Uncategorized
Big Data for Data Scientists – Info Session from WeCloudData…
by WeCloudData
November 9, 2019
Previous
Next

Kick start your career transformation

WeCloudData

WeCloudData is the leading data science and AI academy. Our blended learning courses have helped thousands of learners and many enterprises make successful leaps in their data journeys.

Sign up for newsletter
This field is for validation purposes and should be left unchanged.