	{"id":3679,"date":"2013-10-09T09:50:15","date_gmt":"2013-10-09T02:50:15","guid":{"rendered":"http:\/\/science-technology.vn\/?p=3679"},"modified":"2013-10-31T10:30:01","modified_gmt":"2013-10-31T03:30:01","slug":"ki-nang-big-data","status":"publish","type":"post","link":"https:\/\/science-technology.vn\/?p=3679","title":{"rendered":"K\u0129 n\u0103ng Big Data"},"content":{"rendered":"<p><span style=\"font-size: 14px; line-height: 1.428571429;\">M\u1ed9t sinh vi\u00ean h\u1ecfi t\u00f4i: \u201cEm c\u1ea7n k\u0129 n\u0103ng n\u00e0o \u0111\u1ec3 l\u00e0m vi\u1ec7c trong khu v\u1ef1c Big Data?\u201d \u201cEm c\u00f3 th\u1ec3 h\u1ecdc nh\u1eefng k\u0129 n\u0103ng n\u00e0y \u1edf \u0111\u00e2u?\u201d Xin th\u1ea7y l\u1eddi khuy\u00ean.&#8221;<\/span><\/p>\n<p>&nbsp;<\/p>\n<p>\u0110\u00e1p: Big Data l\u00e0 khu v\u1ef1c \u0111ang n\u1ed5i l\u00ean trong c\u00f4ng ngh\u1ec7 th\u00f4ng tin (CNTT) gi\u1ea3i quy\u1ebft v\u1edbi vi\u1ec7c x\u00e2y d\u1ef1ng &#8220;s\u1ea3n ph\u1ea9m d\u1eef li\u1ec7u&#8221; d\u1ef1a tr\u00ean c\u00e1c thu\u1eadt to\u00e1n ph\u1ee9c t\u1ea1p. N\u00f3 l\u00e0 t\u1ed5 h\u1ee3p c\u1ee7a c\u00e1c khu v\u1ef1c c\u00f4ng ngh\u1ec7 t\u00ednh to\u00e1n, to\u00e1n h\u1ecdc, v\u00e0 qu\u1ea3n l\u00ed d\u1eef li\u1ec7u. K\u0129 n\u0103ng Big Data th\u01b0\u1eddng \u0111\u01b0\u1ee3c d\u1ea1y trong ch\u01b0\u01a1ng tr\u00ecnh b\u1eb1ng th\u1ea1c s\u0129 (th\u1ea1c s\u0129 trong khoa h\u1ecdc m\u00e1y t\u00ednh chuy\u00ean m\u00f4n ho\u00e1 trong Big Data hay th\u1ea1c s\u0129 trong c\u00f4ng ngh\u1ec7 th\u00f4ng tin trong ph\u00e2n t\u00edch d\u1eef li\u1ec7u v.v).<\/p>\n<p>L\u00e0 b\u1eb1ng th\u1ea1c s\u0129, n\u00f3 y\u00eau c\u1ea7u r\u1eb1ng b\u1ea1n ph\u1ea3i c\u00f3 b\u1eb1ng c\u1eed nh\u00e2n trong khoa h\u1ecdc m\u00e1y t\u00ednh, k\u0129 ngh\u1ec7 ph\u1ea7n m\u1ec1m hay qu\u1ea3n l\u00ed h\u1ec7 th\u00f4ng tin \u0111\u1ec3 xin v\u00e0o. \u0110i\u1ec1u \u0111\u00f3 c\u0169ng c\u00f3 ngh\u0129a l\u00e0 b\u1ea1n ph\u1ea3i c\u00f3 k\u0129 n\u0103ng l\u1eadp tr\u00ecnh m\u1ea1nh trong Java, C++ hay Python, c\u00f3 tri th\u1ee9c t\u1ed1t v\u1ec1 c\u1ea5u tr\u00fac d\u1eef li\u1ec7u v\u00e0 thu\u1eadt to\u00e1n, v\u00e0 hi\u1ec3u v\u00f2ng \u0111\u1eddi ph\u00e1t tri\u1ec3n ph\u1ea7n m\u1ec1m, \u0111\u1eb7c bi\u1ec7t cho ph\u1ea7n m\u1ec1m th\u1ef1c hi\u1ec7n c\u00e1c nhi\u1ec7m v\u1ee5 ph\u1ee9c t\u1ea1p.<\/p>\n<p>Trong ch\u01b0\u01a1ng tr\u00ecnh n\u00e0y b\u1ea1n s\u1ebd h\u1ecdc v\u00e0i m\u00f4n trong tr\u00ed tu\u1ec7 nh\u00e2n t\u1ea1o (AI) nh\u01b0 H\u1ecdc m\u00e1y v\u00e0 th\u1ed1ng k\u00ea \u0111\u1ec3 ph\u00e1t tri\u1ec3n c\u00e1c thu\u1eadt to\u00e1n gi\u1ea3i quy\u1ebft v\u1edbi t\u1eadp d\u1eef li\u1ec7u l\u1edbn. B\u1ea1n s\u1ebd h\u1ecdc v\u1ec1 v\u00e0i thu\u1eadt to\u00e1n \u0111\u01b0\u1ee3c d\u00f9ng trong h\u1ecdc m\u00e1y, ch\u00fang gi\u1ea3i quy\u1ebft c\u00e1c v\u1ea5n \u0111\u1ec1 n\u00e0o, v\u00e0 ch\u00fang \u0111\u01b0\u1ee3c th\u1ef1c hi\u1ec7n th\u1ebf n\u00e0o. (t\u1ee9c l\u00e0, \u0111\u1ed9ng c\u01a1 khuy\u1ebfn c\u00e1o, c\u00e2y quy\u1ebft \u0111\u1ecbnh, x\u1eed l\u00ed ng\u00f4n ng\u1eef t\u1ef1 nhi\u00ean v.v.) B\u1ea1n c\u0169ng h\u1ecdc v\u00e0i m\u00f4n trong c\u00f4ng c\u1ee5 m\u00f4 h\u00ecnh ho\u00e1 nh\u01b0 R hay Matlab hay SAS. C\u00e1c c\u00f4ng c\u1ee5 ph\u00e2n t\u00edch th\u1ed1ng k\u00ea v\u00e0 tr\u1ef1c quan ho\u00e1 l\u00e0 r\u1ea5t quan tr\u1ecdng trong c\u00f4ng vi\u1ec7c Big Data \u0111\u1ec3 th\u1ef1c hi\u1ec7n ph\u00e2n t\u00edch h\u1ed3i qui, ph\u00e2n t\u00edch k\u1ebft c\u1ee5m, v\u00e0 ph\u00e2n l\u1edbp d\u1eef li\u1ec7u.<\/p>\n<p>\u0110\u1ec3 gi\u1ea3i quy\u1ebft v\u1edbi t\u1eadp d\u1eef li\u1ec7u l\u1edbn, b\u1ea1n c\u0169ng c\u1ea7n h\u1ecdc c\u00e1c m\u00f4n v\u1ec1 Hadoop, MapReduce, NonSQL, Pig v\u00e0 Hive v\u00e0 Mahout.<\/p>\n<p>Big Data l\u00e0 m\u1edbi v\u00e0 v\u1eabn \u0111ang ti\u1ebfn ho\u00e1. B\u1ea1n ph\u1ea3i h\u1ecdc nhi\u1ec1u c\u00e1c k\u0129 n\u0103ng k\u0129 thu\u1eadt v\u00e0 \u0111\u01b0a v\u00e0o th\u1ef1c h\u00e0nh \u0111\u1ec3 thu l\u1ea5y kinh nghi\u1ec7m. Do \u0111\u00f3 ph\u00e1t tri\u1ec3n c\u00e1c k\u0129 n\u0103ng Big Data, s\u1ebd c\u1ea7n th\u1eddi gian, c\u00f4ng s\u1ee9c \u0111\u1ec3 l\u00e0m vi\u1ec7c nh\u01b0 chuy\u00ean vi\u00ean d\u1eef li\u1ec7u hay nh\u00e0 khoa h\u1ecdc d\u1eef li\u1ec7u. Ng\u00e0y nay Big Data l\u00e0 m\u1ed9t trong nh\u1eefng khu v\u1ef1c c\u00f3 nhu c\u1ea7u cao v\u1edbi tr\u1ea3 l\u01b0\u01a1ng cao trong c\u00f4ng nghi\u1ec7p b\u1edfi v\u00ec c\u00f3 thi\u1ebfu h\u1ee5t tr\u1ea7m tr\u1ecdng v\u1ec1 nh\u1eefng k\u0129 n\u0103ng n\u00e0y.<\/p>\n<p>&nbsp;<\/p>\n<p>&#8212;English version&#8212;<\/p>\n<p>&nbsp;<\/p>\n<p>Big Data skills<\/p>\n<p>A student asked me: \u201cWhat skills do I need to work in the Big Data area?\u201d \u201cWhere can I learn these skills?\u201d Please advice.<\/p>\n<p>&nbsp;<\/p>\n<p>Answer: Big Data is an emerging area of Information Technology (IT) that deals with building \u201cdata products\u201d based on complex algorithms. It is a combination of computing technology, mathematics, and data management areas. Big Data Skills are often taught in a Master\u2019s degree program (Master in Computer Science that specialize in Big Data or Master in Information Technology in Data Analytics etc.).<\/p>\n<p>As a Master\u2019s degree, it requires that you have a Bachelor\u2019s degree in Computer Science, Software Engineering or Information System Management to apply. That also means that you must have a strong programming skills in Java, C++ or Python, have good knowledge of data structures and algorithms, and understand software development lifecycle, especially for software that perform complex tasks.<\/p>\n<p>In this program you will take a few courses in Artificial Intelligence (AI) such as Machine Learning and Statistics to develop algorithms that deal with large datasets. You will learn about several algorithms used in machine learning, which problems they solve, and how they are implemented. (i.e., recommendation engines, decision trees, Natural Language Processing etc.) You also take several courses in modeling tools such as R or Matlab or SAS. These statistical analysis and visualization tools are very important in Big Data works to perform regression analysis, clustering analysis, and data classification.<\/p>\n<p>To deal with large datasets, you also need to take courses to learn about Hadoop, MapReduce, NonSQL, Pig and Hive and Mahout.<\/p>\n<p>Big Data is new and is still evolving. You have to learn a lot of technical skills and put into practice to gain experience. Therefore to develop Big Data skills, it will take time, effort to work as a Data Specialist or Data Scientist. Today Big Data is one of the area that has highest demand with the highest paid in the industry because there is a critical shortage of these skills.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>M\u1ed9t sinh vi\u00ean h\u1ecfi t\u00f4i: \u201cEm c\u1ea7n k\u0129 n\u0103ng n\u00e0o \u0111\u1ec3 l\u00e0m vi\u1ec7c trong khu v\u1ef1c Big Data?\u201d \u201cEm c\u00f3 th\u1ec3 h\u1ecdc nh\u1eefng k\u0129 n\u0103ng &hellip; <\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[30,36],"tags":[],"class_list":["post-3679","post","type-post","status-publish","format-standard","hentry","category-hoi-va-dap","category-social-media-mobility-big-data-analytics-and-cloud-computing"],"_links":{"self":[{"href":"https:\/\/science-technology.vn\/index.php?rest_route=\/wp\/v2\/posts\/3679","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/science-technology.vn\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/science-technology.vn\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/science-technology.vn\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/science-technology.vn\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=3679"}],"version-history":[{"count":3,"href":"https:\/\/science-technology.vn\/index.php?rest_route=\/wp\/v2\/posts\/3679\/revisions"}],"predecessor-version":[{"id":3681,"href":"https:\/\/science-technology.vn\/index.php?rest_route=\/wp\/v2\/posts\/3679\/revisions\/3681"}],"wp:attachment":[{"href":"https:\/\/science-technology.vn\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=3679"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/science-technology.vn\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=3679"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/science-technology.vn\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=3679"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}