Spark General Knowledge

yozzum·2023년 9월 30일
0

Spark

목록 보기
18/21
  1. Resource Manager
    • YARN provides Uptime and Port
    • Uptime : how long is has been running.
    • port : 8088 for incoming client requests and communication with NodeManagers.
  1. HDFS(Hadoop Distributed File system)

    • The output of MAP is written to HDFS in the specified directory configured in mapred.output.dir
  2. SparkSQL query execution phase

    • query parsing > logical optimization > physical planning > code generation > query execution > data serialization/deserialization > result materialization > cleaning up resources
profile
yozzum

0개의 댓글