Cluster Computing for Web-Scale Data Processing

被引:0
|
作者
Kimball, Aaron [1 ]
Michels-Slettvet, Sierra [1 ]
Bisciglia, Christophe
机构
[1] Univ Washington, Dept Comp Sci & Engn, Seattle, WA 98195 USA
关键词
Education; Hadoop; MapReduce; Clusters; Distributed computing;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper we present the design of a modem course in cluster computing and large-scale data processing. The defining differences between this and previously published designs are its focus on processing very large data sets and its use of Hadoop, an open source Java-based implementation of MapReduce and the Google File System as the platform for programming exercises. Hadoop proved to be a key element for successfully implementing structured lab activities and independent design projects. Through this course, offered at the University of Washington in 2007, we imparted new skills on our students, improving their ability to design systems capable of solving web-scale problems.
引用
收藏
页码:116 / 120
页数:5
相关论文
共 50 条
  • [1] Web-Scale Multimedia Processing and Applications
    Chang, Edward
    Chang, Shih-Fu
    Hauptmann, Alexander G.
    Huang, Thomas S.
    Slaney, Malcolm
    PROCEEDINGS OF THE IEEE, 2012, 100 (09) : 2580 - 2583
  • [2] Web-scale semantic information processing
    Heflin, Jeff
    Stuckenschmidt, Heiner
    JOURNAL OF WEB SEMANTICS, 2012, 10 : 1 - 2
  • [3] Web-Scale Extraction of Structured Data
    Cafarella, Michael J.
    Madhavan, Jayant
    Halevy, Alon
    SIGMOD RECORD, 2008, 37 (04) : 55 - 61
  • [4] Web-Scale Classification: Web Classification in the Big Data Era
    Partalas, Ioannis
    Amini, Massih-Reza
    Androutsopoulos, Ion
    Artieres, Thierry
    Gallinari, Patrick
    Gaussier, Eric
    Paliouras, Georgios
    WSDM'14: PROCEEDINGS OF THE 7TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2014, : 687 - 688
  • [5] Creating voiD descriptions for Web-scale data
    Boehm, Christoph
    Lorey, Johannes
    Naumann, Felix
    JOURNAL OF WEB SEMANTICS, 2011, 9 (03): : 339 - 345
  • [6] Building web-scale data mining infrastructure for search
    Ma, Wei-Ying
    PROGRESS IN WWW RESEARCH AND DEVELOPMENT, PROCEEDINGS, 2008, 4976 : 9 - 9
  • [7] Web-Scale Datacenters
    Douglis, Fred
    IEEE INTERNET COMPUTING, 2014, 18 (04) : 13 - 14
  • [8] PNUTS in Flight: Web-Scale Data Serving at Yahoo
    Silberstein, Adam
    Chen, Jianjun
    Lomax, David
    McMillen, Brad
    Mortazavi, Masood
    Narayan, P. P. S.
    Ramakrishnan, Raghu
    Sears, Russell
    IEEE INTERNET COMPUTING, 2012, 16 (01) : 13 - 23
  • [9] Web-scale multimedia data management: Challenges and remedies
    Chang, Edward Y.
    14TH INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND PROCESSING WORKSHOPS, PROCEEDINGS, 2007, : 3 - 8
  • [10] Sentence Completion Task using Web-scale Data
    Lee, Kyusong
    Lee, Gary Geunbae
    2014 INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2014, : 173 - 176