[AWS EMR Hadoop 실습] HDFS이용하는 Java Application 구현하기

Hyunjun Kim·2025년 8월 14일

Data_Engineering

목록 보기

130/153

5 HDFS이용하는 Java Application 구현하기

5.1 소스코드

settings.gradle

rootProject.name = 'hadoop-hdfs-app'

build.gradle

plugins {
    id 'java'
}

group 'de.example.hadoop.hdfs'
version '1.0-SNAPSHOT'

repositories {
    mavenCentral()
}

dependencies {
    implementation 'org.apache.hadoop:hadoop-common:3.2.1'
    implementation 'org.apache.hadoop:hadoop-hdfs-client:3.2.1'
}

test {
    useJUnitPlatform()
}

de.example.hadoop.hdfs.InputReadAndFileWriter.java

package de.example.hadoop.hdfs;

import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.fs.FSDataInputStream;
import org.apache.hadoop.fs.FSDataOutputStream;
import org.apache.hadoop.fs.FileSystem;
import org.apache.hadoop.fs.Path;

public class InputReadAndFileWriter {

    public static void main(String[] args) {
        if (args.length != 2) {
            System.err.println("Usage: InputReadAndFileWriter <filename> <content>");
            System.exit(1);
        }

        String filePath = args[0];
        String contents = args[1];

        try {
            Configuration configuration = new Configuration();
            FileSystem hdfs = FileSystem.get(configuration);

            // Check path & delete if exists
            Path path = new Path(filePath);
            if (hdfs.exists(path)) {
                hdfs.delete(path, true);
                System.out.println("#-#-# " + filePath + " is deleted.");
            }

            // Write contents as file
            FSDataOutputStream outputStream = hdfs.create(path);
            outputStream.writeUTF(contents);
            outputStream.close();

            //
            FSDataInputStream inputStream = hdfs.open(path);
            String result = inputStream.readUTF();
            inputStream.close();

            System.out.println("#-#-# Saved contents: " + result);

        } catch (Exception e) {
            e.printStackTrace();
        }
    }
}

5.2 실행

Jar 를 Primary Node 로 이동

Primary 노드에 접속해서 경로 생성

mkdir /home/hadoop/example/jars

로컬의 jar 를 scp 로 primary 노드로 이동

$key: primary node EC2 에 접속할 수 있는 key 파일
$project_path: 5.1 을 수행한 java project 의 path
$primary_node: primary node의 public dns 또는 ip 주소

scp -i $key $project_path/build/libs/hadoop-hdfs-app-1.0-SNAPSHOT.jar hadoop@$primary_node:~/example/jars/.

Hadoop 명령어로 jar 실행

hadoop jar $jar_file $main_classname $args

hadoop jar hadoop-hdfs-app-1.0-SNAPSHOT.jar de.example.hadoop.hdfs.InputReadAndFileWriter /data/example/hdfs/input.txt 'Hello world, hello hdfs!'

hdfs 명령어로 input.txt 파일을 확인한다.

hdfs dfs -ls /data/example
hdfs dfs -head /data/example/input.txt

Hyunjun Kim

Data Analytics Engineer 가 되

이전 포스트

[AWS EMR Hadoop 실습] EMR 클러스터 웹 인터페이스

다음 포스트

[AWS EMR Hadoop 실습] HDFS이용하는 Java Application 구현하기

Data_Engineering

5 HDFS이용하는 Java Application 구현하기

5.1 소스코드

settings.gradle

build.gradle

de.example.hadoop.hdfs.InputReadAndFileWriter.java

5.2 실행

Jar 를 Primary Node 로 이동

Hadoop 명령어로 jar 실행

[AWS EMR Hadoop 실습] EMR 클러스터 웹 인터페이스

[Yarn] What is Yarn?

0개의 댓글