Big Data Processing (Spark) as Told by a Silicon Valley Leader

Name: Big Data Processing (Spark) as Told by a Silicon Valley Leader
Price: 108900 KRW

How is processing big data different from processing data with Pandas? Let's learn about Spark, an essential framework for big data processing.

(5.0) 2 reviews

39 students

keeyonghan9539

Apache Spark

pyspark

Pandas

Big Data

SQL

What you will learn!

Spark
Big data processing
Databricks
Spark SQL
Data Engineering

A course covering Spark performance optimization and practical logic implementation .

This course will help you implement a variety of scenarios, including user behavior analysis, channel flow analysis, and sales aggregation.
Learn the core strategies for designing and utilizing Spark quickly and flexibly. You will also learn Partition, Shuffling, Join methods, and advanced features such as Parquet, UDF, and UDAF in a step-by-step manner, so you can naturally develop the performance optimization and complex logic implementation capabilities required for practical work .

Hello. I am Ki-Yong Han, a data expert in Silicon Valley with 30 years of experience. After starting my career at Samsung Electronics, I moved to Silicon Valley at the age of 31 and spent the first 11 years developing web search at Yahoo, where I first encountered big data processing. Since then, I have built data teams at organizations such as Udemy (listed on NASDAQ in 2021) and Polyvore (acquired by Yahoo in 2015), and have provided data consulting to various Silicon Valley and Korean companies . Based on this, I will share essential skills for data engineers based on my experience teaching master's students at San Jose State University, which boasts the highest employment rate in Silicon Valley .

Recommended for
these people!

Who is this course right for?

Someone who is basically interested in big data processing
Someone with Pandas experience who wants to expand into big data processing technologies.
Someone who needs big data processing for their work.

Need to know before starting?

Experience using Pandas
Basic Python
Basic SQL

Hello
This is keeyonghan9539

794

Students

Reviews

Answers

4.9

Rating

Courses

컴퓨터 공학 석사 후 삼성전자에서 시작된 커리어가 친구덕에 실리콘밸리로 이어져 지난 29년간 13개의 다양한 스테이지의 회사를 다녔습니다 (창업, 대기업들, 다수의 스타트업들).

야후: 엔지니어링 디렉터로 검색엔진 개발.
유데미. 데이터팀을 처음 만들어 30명까지 성장. 2021년 10월에 나스닥 상장
삼성전자
...

중간에 11개월 쉬어보기도 했고 본의 아니게 엔젤투자자(Chartmetric, Goodtime.io, Select Star, EO, 비지니스 캔버스, ...), 어드바이저(몰로코, 블라인드, 월급쟁이부자들, ...), 컨설팅(SK텔레콤, 현대카드, 이마트 등등) 등의 역할을 하면서 나만의 브랜드를 만들었습니다. 실패를 실패가 아닌 교훈으로 보는 긍정의 힘과 꾸준함이라는 복리의 힘을 믿습니다.

https://www.linkedin.com/in/keeyonghan/

유투브 채널

월급쟁이부자들 강의

Curriculum

All

45 lectures ∙ (11hr 25min)

Course Materials:

Lecture resources

Section 1. Course Introduction

3 lectures ∙ (14min)

1. Instructor Info
05:29
2. Course Introduction
08:31
3. Lecture materials

Section 2. From Small Data to Big Data

5 lectures ∙ (1hr 2min)

4. Evolution of Data Systems: Small -> Big
17:07
5. Big Data Definition and Examples
06:52
6. Big Data Processing Features
05:31
7. The advent and introduction of Hadoop
17:33
8. Problems and Alternatives of MapReduce Programming
15:09

Section 3. From Pandas to Spark!

8 lectures ∙ (2hr 39min)

9. Pandas Introduction
09:03
10. Comparison of Pandas and Spark through Examples
26:37
11. Spark Introduction
10:34
12. Introduction to Spark Architecture and Practice Environment
19:51
13. Spark Program Structure
24:33
14. Spark Program Structure Practice
21:09
15. Performance Comparison of Pandas and Spark Using Examples
17:53
16. Performance Comparison of Pandas and Spark through Examples (Hands-on)
29:39

Section 4. Spark Programming

12 lectures ∙ (3hr 27min)

Section 5. Practice and Extension

15 lectures ∙ (3hr 53min)

Section 6. Wrap-up and Next Steps

2 lectures ∙ (9min)

Published: 04/20/2025

Last updated: 05/30/2025

Reviews

All

2 reviews

5.0

2 reviews

diazepam57
Reviews 8
∙
Average Rating 5.0
07/02/2025
5
60% enrolled
everythx
Reviews 10
∙
Average Rating 5.0
05/23/2025
5
32% enrolled
고스펙의 실무와 대학강의를 겸비하셔서인지 이해가 쉽게됩니다

keeyonghan9539's other courses

Check out other courses by the instructor!

실리콘밸리 데이터 리더가 알려주는 Airflow 기초

한기용

$132,000.00

Basic / airflow, snowflake, SQL, Python

5.0

(4)

100+

AI 시대가 도래하면서, 데이터 파이프라인 구성은 기업 경쟁력을 좌우하는 핵심 역량으로 자리 잡았습니다. 가장 널리 사용되는 Airflow를 활용해 효율적인 데이터 파이프라인을 구축하는 노하우를, 실전 경험과 풍부한 강의 경력을 지닌 실리콘밸리 전문가(前 유데미 데이터팀 헤드, 現 산호세 주립대 데이터 석사 과정 교수)에게 직접 배워보세요.

Basic

airflow, snowflake, SQL

실리콘밸리 데이터 리더가 알려주는 기초 SQL

한기용

$71,500.00

Basic / SQL, 데이터 리터러시, 데이터 엔지니어링, 빅데이터, DBMS/RDBMS, duckdb

5.0

(2)

데이터를 하는 사람이라면 꼭 알아야하는 기본 기술은 SQL입니다. 이번 강의에서는 SQL을 데이터 분석이란 관점에서 실습 위주로 학습해보겠습니다. 실습은 DuckDB를 가지고 Google Colab에서 진행합니다.

Basic

SQL, 데이터 리터러시, 데이터 엔지니어링

[멘토링] 데이터로 미래를 그리다: 모두를 위한 데이터 리터러시

한기용

$264,000.00

Beginner / 데이터 리터러시, 데이터 엔지니어링, 데이터 트랜스포메이션, EDA

4.8

(11)

Update

Mentoring

데이터에 관심있는 개인이나 리더를 대상으로 데이터 팀이 하는 일을 소개하고 조직의 데이터 활용 능력을 나타내는 데이터 문해력이 어떤 것인지 소개합니다.

Beginner

데이터 리터러시, 데이터 엔지니어링, 데이터 트랜스포메이션

Similar courses

Explore other courses in the same field!

Data Engineering Course (1) : 빅데이터 하둡 직접 설치하기

Billy Lee

$55,000.00

Basic / 빅데이터, Hadoop, 데이터 엔지니어링

4.6

(36)

500+

하둡과 빅데이터를 배우고자 하는 수강생들은 이 과정을 통해 빅데이터 세계를 경험하는 놀라운 발전을 기념할 것입니다!

Basic

빅데이터, Hadoop, 데이터 엔지니어링

실리콘밸리 데이터 리더가 알려주는 기초 SQL

한기용

$71,500.00

Basic / SQL, 데이터 리터러시, 데이터 엔지니어링, 빅데이터, DBMS/RDBMS, duckdb

5.0

(2)

Basic

SQL, 데이터 리터러시, 데이터 엔지니어링

업무에 바로 써먹는 데이터 마인드(데이터 리터러시) 향상 방법

한국사회능력개발원

$111,100.00

Beginner / 데이터 리터러시, 빅데이터, 머신러닝, 문제해결능력

New

데이터 분석 경험도 별다른 기술도 없는 기획자, 마케터가 가장 기초적인 수준에서 데이터 분석을 해볼 수 있는 방법을 다양한 사례와 함께 알려 드립니다. 여러 해 동안 100여 개 기업과 공공기관 등에서 2천여 명에 이르는 수강자들과 함께 실습하며 데이터 비전문가 입장에서 가장 현실적으로 활용 가능한 분석법으로 내용을 구성했습니다.

Beginner

데이터 리터러시, 빅데이터, 머신러닝

R을 활용한 빅데이터 및 통계분석

한국사회능력개발원

$129,800.00

Beginner / R, 빅데이터

New

R프로그래밍을 이용해 누구나 빅데이터 분석을 할 수 있도록 데이터의 기본적인 개념, R의 유용한 함수와 패키지, 데이터 분석 실습을 담았습니다.

Beginner

R, 빅데이터

[2025] SQLD 문제가 어려운 당신을 위한 노랭이 176 문제 풀이

데이터코드랩

$39,600.00

Basic / SQL, SQLD, 빅데이터, Oracle, MSSQL

5.0

(5)

공부는 했지만 문제를 풀지 못하는 당신을 위한 SQLD 노랭이 176 문제 풀이 강의. 완강 후 합격을 넘어 전문가가 됩니다. SQLD 올인원패스!

Basic

SQL, SQLD, 빅데이터

데이터입문자를 위한 Azure 데이터 기초 완전정복

이상희강사

$110,000.00

Beginner / SQL, 빅데이터, 데이터 엔지니어링, database, 데이터 리터러시

Microsoft AZ-900 자격을 동시에 대비 할 수 있는 이론적 토대를 마련 할 수 있는 특강이며 2025년 5월 기준의 출제 범위를 반영한 최신 콘텐트로서 핵심 데이터 개념 ,Azure의 관계형 데이터 ,Azure의 비관계형 데이터,Azure의 분석 워크로드에 관련된 내용을 이론과 실습이 겸비된 형태로 제공함으로서 자격증 취득은 물론 데이터 전문가로의 첫걸음 다지는 의미있는 교육 기회로 활용 할 수 있습니다

Beginner

SQL, 빅데이터, 데이터 엔지니어링

모르면 퇴사각? 데이터 엔지니어링 정석

미쿡엔지니어

$110,000.00

Basic / 빅데이터, 데이터 엔지니어링, 아키텍처

5.0

(4)

100+

Active Replies

데이터 시대, 진정한 가치를 발견하라! 📊 데이터에 집중된 어플리케이션 설계는 이제 필수가 되었습니다. 최신 트렌드와 실무 중심의 사례로 회사가 원하는 인사이트와 실력을 키워보세요. 효율적인 데이터 처리와 설계 비법, 지금 바로 시작하세요! 당신의 다음 스텝, 데이터 중심의 세계로 도약하세요!

Basic

빅데이터, 데이터 엔지니어링, 아키텍처

[무료]기초 텍스트마이닝: 앱 리뷰 분석 with 파이썬(40분 완성)

HappyAI

Free

Basic / 텍스트마이닝, 빅데이터, NLP, 데이터 리터러시

4.7

(13)

500+

이 강의는 파이썬을 활용한 텍스트마이닝 분석에 관한 기초 이론과 실습을 배울 수 있습니다. 실무나 논문 작성 시 필요한 기초적인 텍스트마이닝 데이터 분석 기법을 설명합니다.

Basic

텍스트마이닝, 빅데이터, NLP

[관리코스 #3] DE, DBA (SSIS, SSAS, MachineLearning, BI, ETL)

개발자Park

$55,000.00

Basic / 빅데이터, ssis, ssas, 머신러닝, etl

5.0

(2)

Active Replies

SSIS, SSAS, MachineLearning, BI, ETL. 국내의 도서, 유튜브, 강의, 블로그, 학원에서 찾아볼 수 없는 중요한 기술을 배울 수 있습니다. 국내 대기업, 미국 대기업 및 미국 주정부 자금 지원 기관 취업에 관심 있는 분들께도 추천해요.

Basic

빅데이터, ssis, ssas

$84.70

Big Data Processing (Spark) as Told by a Silicon Valley Leader

What you will learn!

Silicon Valley Engineer Explains
Data Pipeline Design Practices

Spark, the standard for large-scale data processing

Why you should take this course

Learn about these things

I recommend this to these people

After class

Insights from Silicon Valley -proven big data experts

Things to note before taking the class

Practice environment

Learning Materials

Player Knowledge and Notes

Recommended for
these people!

Hello
This is keeyonghan9539

Curriculum

Reviews

keeyonghan9539's other courses

Similar courses

Big Data Processing (Spark) as Told by a Silicon Valley Leader

What you will learn!

Silicon Valley Engineer Explains Data Pipeline Design Practices

Spark, the standard for large-scale data processing

Why you should take this course

Learn about these things

I recommend this to these people

After class

Insights from Silicon Valley -proven big data experts

Things to note before taking the class

Practice environment

Learning Materials

Player Knowledge and Notes

Recommended for these people!

HelloThis is keeyonghan9539

Curriculum

Reviews

keeyonghan9539's other courses

Similar courses

Silicon Valley Engineer Explains
Data Pipeline Design Practices

Recommended for
these people!

Hello
This is keeyonghan9539