HOME / ¹®¼¸¶ÄÏ / ¸®Æ÷Æ® / ±â¼ú°øÇаè¿
(¿µ¹®) ¾Ö³Î¸®Æ½½º ºòµ¥ÀÌÅÍ ÆǸŠ¿¹Ãø¸ðµ¨ ÇÁ·ÎÁ§Æ® ÀÚ·áÀÔ´Ï´Ù
Àüü 9 page Áß 9 page±îÁö ¹Ì¸®º¸±â°¡ °¡´ÉÇÕ´Ï´Ù.
¼Ò°³±Û
(¿µ¹®) ¾Ö³Î¸®Æ½½º ºòµ¥ÀÌÅÍ ÆǸŠ¿¹Ãø¸ðµ¨ ÇÁ·ÎÁ§Æ®¿¡ ´ëÇØ ±â¼úÇÑ ¸®Æ÷Æ® Âü°íÀÚ·áÀÔ´Ï´Ù.
ÀúÀ۽ñâ : 2021³â 1¿ù
º»¹®³»¿ë
PROJECT DELIVERABLE #1
Source of the Data
Our team obtained the data fr
om an educational analytics website called, "Analytics Vidhya." According to the website, the data was collected by the data scientists at BigMart. The data is comprised of BigMart's 2013 sales data for 1559 products across 10 stores in different cities.
The link to the website is as
follows:
https://datahack.analyticsvid
hya.com/contest/practice-problem-big-mart-sales-iii/#data_dictionary
Description of the Data
There are two csv data files:
1. Test, and 2. Train.
"Train" is the main data set
our team will work on to build a predictive model.
We use "Test" data set to app
ly our prediction model once it is built.
1. Size of the Data
1. Train data set: 8,523 rows
, 12 columns
2. Test data set: 5,681 rows,
11 columns
The data set looks like the f
ollowing figure.
Variables Description
Item_Identifier
Unique product ID
Item_Weight
Weight of product
Item_Fat_Content
Whether the product is low fa
t or not (ÀÌÇÏ »ý·«)
Âü°í¹®Çå
¾øÀ½
¹ÞÀº º°Á¡
0/5
0°³ÀÇ º°Á¡
¹®¼°øÀ¯ ÀڷḦ µî·ÏÇØ ÁÖ¼¼¿ä.
¹®¼°øÀ¯ Æ÷ÀÎÆ®¿Í Çö±ÝÀ» µå¸³´Ï´Ù.
Æ÷ÀÎÆ® : ÀÚ·á 1°Ç´ç ÃÖ´ë 5,000P Áö±Þ
Çö±Ý : ÀÚ·á 1°Ç´ç ÃÖ´ë 2,000¿ø Áö±Þ
ÀúÀÛ±Ç Á¤º¸
À§ Á¤º¸ ¹× °Ô½Ã¹° ³»¿ëÀÇ Áø½Ç¼º¿¡ ´ëÇÏ¿© ¾ÅÅ©Á¸Àº º¸ÁõÇÏÁö ¾Æ´ÏÇϸç, ÇØ´ç Á¤º¸ ¹× °Ô½Ã¹° ÀúÀ۱ǰú ±âŸ ¹ýÀû Ã¥ÀÓÀº ÀÚ·á µî·ÏÀÚ¿¡°Ô ÀÖ½À´Ï´Ù.
À§ Á¤º¸ ¹× °Ô½Ã¹° ³»¿ëÀÇ ºÒ¹ýÀû ÀÌ¿ë, ¹«´Ü ÀüÀ硤¹èÆ÷´Â ±ÝÁöµÇ¾î ÀÖ½À´Ï´Ù.
ÀúÀÛ±ÇħÇØ, ¸í¿¹ÈÑ¼Õ µî ºÐÀï¿ä¼Ò ¹ß°ß½Ã °í°´¼¾ÅÍÀÇ ÀúÀÛ±Ç Ä§ÇØ½Å°í¸¦ ÀÌ¿ëÇØ Áֽñ⠹ٶø´Ï´Ù.