データ科学生態系パート 2：データの論争|The data science ecosystem part 2: Data wrangling

オープンデータ関連のニュースです。

http://www.computerworld.com/article/2902920/the-data-science-ecosystem-part-2-data-wrangling.html

データ科学生態系パート 2：データの論争|The data science ecosystem part 2: Data wrangling

日本語

Michael Cavaretta、データの科学者フォード・モーター、ニューヨーク・タイムズの最近の記事からのお金の引用があった。部分は、データの科学者が直面する課題、日常の業務についての予定についてだった。Cavaretta は言った：「我々本当に必要より良いツールデータ論争に少ない時間を過ごすことができますので、セクシーなものを得る.」論争のデータは、データのクリーンアップ、ツールを接続し、; 使用可能な形式にデータを取得セクシーなものは、予測分析、モデリングです。最初の時と呼びます「用務員の仕事」を考えると、1 つは、もう少し楽しくを推測できます。CrowdFlower の最近の調査ではデータの科学者過ごした時間論争データの固体の 80 ％を発見します。
続きを読む…

English

There was a money quote from Michael Cavaretta, a data scientist at Ford Motor, in a recent article in The New York Times. The piece was about the challenges data scientists face going about their daily business. Cavaretta said: “We really need better tools so we can spend less time on data wrangling and get to the sexy stuff.” Data wrangling is cleaning data, connecting tools and getting data into a usable format; the sexy stuff is predictive analysis and modeling. Considering that the first is sometimes referred to as “janitor work,” you can guess which one is a bit more enjoyable. In CrowdFlower’s recent survey, we found that data scientists spent a solid 80% of their time wrangling data.
Read more…

オープンデータとプログラミング

オープンデータの利活用を考えるビジネス情報サイト

Open Data

データ科学生態系パート 2：データの論争|The data science ecosystem part 2: Data wrangling

by opendata • 2015年4月2日

日本語

English

Open Data

データ科学生態系パート 2： データの論争|The data science ecosystem part 2: Data wrangling

by opendata • 2015年4月2日

日本語

English

Post navigation

データ科学生態系パート 2：データの論争|The data science ecosystem part 2: Data wrangling