端到端算法工程师实习生

小米科技 Xiaomi Technology (Beijing, China) Follow 1 day ago

Premium Full-time Systematics Python Kernel EDGE Deep Learning

Apply Now

端到端算法工程师实习生

北京

校招

实习

软件研发类

职位描述

1. Develop quantization, sparsity, pruning, and distillation techniques to enhance the production-level autonomous driving models.2. Optimize, convert, and deploy autonomous driving models (e.g., ONNX models) that operate efficiently on diverse hardware (GPU, CPU, in-house AI ASIC).3. Design and implement custom kernels using C++ and CUDA to accelerate model operations and pre/post-processing pipelines.4. Perform a systematic benchmarking, scaling, and validation of inference performance across various hardware platforms (GPU, CPU, in-house AI ASIC).5. Collaborate with hardware, compiler, and AI infra engineers to achieve efficient and accurate AI model inference.

职位要求

1. Familiar with techniques like quantization, sparsity, pruning, and distillation for edge and real-time inference.2. Strong proficiency in Python and C++, and deep learning frameworks such as PyTorch.3. Hands-on expertise with CUDA programming, low-level performance profiling, and compiler-level optimization.4. Strong understanding of computer systems and architecture, with experience deploying AI models on GPUs and NPUs.5. Experience with model optimization tools like ModelOpt/ONNXSim/TensorRT, as well as deploying models on edge devices or mobile platforms.6. Strong problem-solving skills with the ability to debug and optimize high-performance inference workloads.7. Experience collaborating with hardware/compiler/AI infra engineers to connect model-level and system-level optimization.

投递

Apply Now

Save Job

Sign In
Create Account

Sign in

To continue your application

or continue with email

By continuing you agree to our Terms & Privacy Policy.

Similar jobs

端到端算法工程师实习生

实习-智能辅助驾驶端到端算法工程师实习生（端到端模型 / 强化学习）

实习-智能辅助驾驶端到端算法工程师实习生（端到端模型 / 强化学习）

实习-智能辅助驾驶端到端算法工程师实习生（端到端模型 / 强化学习）