본문으로 바로가기

강의

Python으로 배우는 사기 탐지

중급기술 수준

업데이트됨 2024. 8.

파이썬을 사용하여 사기를 탐지하는 방법을 배우세요.

무료로 강의 시작

PythonMachine Learning

4시간

16 동영상

57 연습 문제

4,800 XP

22,044

성취 증명서

수천 개 기업의 학습자들이 사랑하는

팀을 교육하시나요?

비즈니스용으로 체험해 보세요

강의 설명

일반적인 조직은 매년 매출의 약 5%를 사기로 잃는 것으로 추정됩니다. 이 강의에서는 데이터를 활용해 사기를 대응하는 방법을 배웁니다. 예를 들어, 과거와 유사한 사기 행위를 탐지하기 위해 지도 학습 알고리즘을 적용하는 법과, 새로운 유형의 사기 활동을 발견하기 위한 비지도 학습 방법을 학습합니다. 또한 사기 분석에서는 사기와 비사기를 분류할 때 극도로 불균형한 데이터셋을 자주 다루게 되며, 강의를 통해 이를 처리하는 기법들도 익히게 됩니다. 이 강의는 기술적 내용과 이론적 인사이트를 함께 제공하며, 사기 탐지 모델을 실무에서 구현하는 방법을 직접 다뤄 봅니다. 더불어 실제 경험에서 비롯된 팁과 조언을 제공해, 사기 분석에서 흔히 하는 실수를 피할 수 있도록 도와드립니다.

선수 조건

Unsupervised Learning in Python Supervised Learning with scikit-learn

1

Introduction and preparing your data

In this chapter, you'll learn about the typical challenges associated with fraud detection, and will learn how to resample your data in a smart way, to tackle problems with imbalanced data.

Introduction to fraud detection

Checking the fraud to non-fraud ratio

Plotting your data

Increasing successful detections using data resampling

Resampling methods for imbalanced data

Applying SMOTE

Compare SMOTE to original data

Fraud detection algorithms in action

Exploring the traditional way to catch fraud

Using ML classification to catch fraud

Logistic regression combined with SMOTE

Using a pipeline

2

Fraud detection using labeled data

Now that you're familiar with the main challenges of fraud detection, you're about to learn how to flag fraudulent transactions with supervised learning. You will use classifiers, adjust them, and compare them to find the most efficient fraud detection model.

Review of classification methods

Natural hit rate

Random Forest Classifier - part 1

Random Forest Classifier - part 2

Performance evaluation

Performance metrics for the RF model

Plotting the Precision Recall Curve

Adjusting your algorithm weights

Model adjustments

Adjusting your Random Forest to fraud detection

GridSearchCV to find optimal parameters

Model results using GridSearchCV

Ensemble methods

Logistic Regression

Voting Classifier

Adjust weights within the Voting Classifier

3

Fraud detection using unlabeled data

This chapter focuses on using unsupervised learning techniques to detect fraud. You will segment customers, use K-means clustering and other clustering algorithms to find suspicious occurrences in your data.

Normal versus abnormal behavior

Exploring your data

Customer segmentation

Using statistics to define normal behavior

Clustering methods to detect fraud

Scaling the data

K-means clustering

Elbow method

Assigning fraud versus non-fraud

Detecting outliers

Checking model results

Other clustering fraud detection methods

Assessing smallest clusters

Checking results

4

Fraud detection using text

In this final chapter, you will use text data, text mining, and topic modeling to detect fraudulent behavior.

Using text data

Word search with dataframes

Using list of terms

Creating a flag

Text mining to detect fraud

Removing stopwords

Cleaning text data

Topic modeling on fraud

Create dictionary and corpus

Flagging fraud based on topics

Interpreting the topic model

Finding fraudsters based on topic

Python으로 배우는 사기 탐지

강의
완료

수료증 획득

LinkedIn 프로필, 이력서 또는 CV에 이 인증서를 추가하세요
소셜 미디어와 성과 평가에서 공유하세요지금 등록

19백만 명 이상의 학습자와 함께 Python으로 배우는 사기 탐지을(를) 시작하세요!

DataCamp for Mobile을 통해 데이터 분석 능력을 향상시키세요.

모바일 강좌와 매일 5분 코딩 챌린지를 통해 이동 중에도 학습 효과를 높이세요.