일 | 월 | 화 | 수 | 목 | 금 | 토 |
---|---|---|---|---|---|---|
1 | 2 | 3 | 4 | 5 | 6 | 7 |
8 | 9 | 10 | 11 | 12 | 13 | 14 |
15 | 16 | 17 | 18 | 19 | 20 | 21 |
22 | 23 | 24 | 25 | 26 | 27 | 28 |
29 | 30 |
- 알고리즘
- 영어
- 코딩
- Spark
- Apache Kafka
- 자바
- Data Engineer
- apache iceberg
- 여행
- 코테
- Iceberg
- Linux
- 코딩테스트
- pyspark
- BigData
- 용인맛집
- HIVE
- 개발
- 코엑스맛집
- Data Engineering
- 프로그래머스
- bigdata engineer
- Trino
- 맛집
- Kafka
- java
- hadoop
- 삼성역맛집
- 백준
- bigdata engineering
- Today
- Total
목록bigdata engineering (7)
지구정복
CHAPTER 4 Optimizing the Performance of Iceberg TablesCompaction When you are querying yourApache Iceberg tables, you need to open and scan each file and then close the filewhen you’re done. The more files you have to scan for a query, the greater the costthese file operations will put on your query. it is possible to run intothe “small files problem,” where too many small files have an impact o..

A data warehouse acts as a centralized repository for organizations to store all theirdata coming in from a multitude of sources, allowing data consumers such as analystsand BI engineers to access data easily and quickly from one single source to start their analysis The Data LakeWhile data warehouses provided a mechanism for running analytics on structureddata, they still had several issues:..

현재 Openldap과 hue, Hive연동된 상태이다. Hue의 hive editor를 오픈하면 아래 에러가 자꾸 발생했다. Bad status: 3 (b'Error validating the login') (code THRIFTTRANSPORT): TTransportException("Bad status: 3 (b'Error validating the login')") 또한 Hue log를 확인해보니 아래 내용이 있었다.[27/Feb/2025 16:50:17 +0900] base DEBUG Selected interpreter hive interface=hiveserver2 compute=None [27/Feb/2025 16:50:17 +0900] dbms DEBU..