TPCx-AI

Creator

Creator

Seonglae Cho

Created

Created

2024 Mar 12 11:47

Editor

Editor

Seonglae Cho

Edited

Edited

2025 Jun 23 23:21

Refs

Refs

Pipeline

Data Acquisition

Data Transformation

Model Training

Model Inference

Model Scoring

Project folder structure

workload main use case script

python workload scripts for each use case for single-node system
spark workload scripts for each use case for multi-node system
spark3 workload scripts for each use case for multi-node system

data-gen

config

tpcxai-generation.xml dataset download specification
tpcxai-schema.xml dataset type schema specification

tools

python python setup scripts for single-node system

python.yaml default python conda environment file for spark engine
python-ks.yaml I guess ks stands for kitchen sink with additional packages

spark spark setup scripts for multi-node system

lib - java jar libraries

driver - project config & source code (tensorflow & keras base)

Project scripts

setenv.sh set necessary environment variables and scale factor

setup-python.sh create virtual python environment for single-node system

setup-spark.sh create virtual python environment for multi-node system

setup-spark.sh create virtual python environment for multi-node system

TPCx-AI_Benchmarkrun.sh run benchmark

TPCx-AI_Validation.sh run validation

Full_TPCx-AI_Benchmarkrun run validation and benchmark

notion image

notion image

notion image

notion image

notion image

notion image

notion image

notion image

notion image

PPT

https://www.tpc.org/tpcx-ai/TPCx-AI_An_Introduction_v1.3.0.pdf

Report

https://www.vldb.org/pvldb/vol16/p3649-rabl.pdf

Korean

https://www.tta.or.kr/tta/preportNewsNDownload.do?sfn=20230507121847947_Z2JK.pdf

Recommendations

//////