官术网_书友最值得收藏!

Defining the Problem Statement

If you recollect the data we explored in Chapter 1, R for Advanced Analytics, bank marketing data, we have a dataset that captures the telemarketing campaigns conducted by a bank to attract customers.

A large multinational bank is designing a marketing campaign to achieve its growth target by enticing customers for bank deposits. The campaign has been ineffective in luring customers, and the marketing team wants to understand how the campaign can be improved to achieve the growth targets.

We can reframe the problem from the business stakeholders' perspective and try to see what kind of solution would best fit here.

Problem-Designing Artifacts

Just like there are several frameworks, templates, and artifacts for software engineering and other industrial projects, data science and business analytics projects can also be effectively represented using industry standard artifacts. Some popular choices are available from consulting giants such as McKinsey, BCG, and decision sciences giants such as Mu Sigma. We will use a popular framework based on the Minto Pyramid principle called Situation - Complication -Question Analysis (SCQ).

Let's try defining the problem statement in the following construct:

  • Situation: Define the current situation. We can simplify this by answering the question—what happened?

    A large multinational bank is designing a marketing campaign to achieve its growth target by enticing customers for bank deposits. The campaign has been ineffective in luring customers, and the marketing team wants to understand how the campaign can be improved to achieve the growth targets.

In the previous section, we saw a hypothetical business problem framed for the banking data's use case. Though this might be different in reality, we are definitely trying to solve a valid use case. By representing the problem statement in the format demonstrated as in the previous format, we have a clear area to focus on and solve. This solves the first step in the life cycle of a typical data science use case. The second step is data gathering, which we explored in the previous chapter. We will refer to the same dataset provided by UCI machine learning repository at https://archive.ics.uci.edu/ml/datasets/Bank%20Marketing.

Note

[Moro et al., 2014] S. Moro, P. Cortez, and P. Rita. A Data-Driven Approach to Predict the Success of Bank Telemarketing. Decision Support Systems, Elsevier, 62:22-31, June 2014.

This brings us to the final step: EDA. In this use case, we want to understand the various factors that are leading to the poor performance of the campaign. Before we delve into the actual exercise, let's take a moment to understand the concept of EDA in a more intuitive way.

主站蜘蛛池模板: 西乡县| 葫芦岛市| 璧山县| 拉萨市| 常熟市| 边坝县| 绍兴市| 衡南县| 合作市| 平阳县| 靖安县| 孙吴县| 武城县| 鄱阳县| 鹿邑县| 子洲县| 佛坪县| 剑河县| 游戏| 清原| 山阳县| 大英县| 原阳县| 浦县| 铅山县| 望奎县| 赤壁市| 高密市| 浙江省| 浠水县| 盘锦市| 赤城县| 天全县| 乐清市| 禄丰县| 龙川县| 洮南市| 千阳县| 宁津县| 富宁县| 姜堰市|