节点,常用程序计算配置设置

磁盘大小查询：df -h .

当前文件夹大小：du -sh *

Nova维护后：

建议在 .bashrc 中不要加载特定版本的模块

集群环境更新频繁：HPC 管理员常常会升级/移除旧版本模块，导致你 .bashrc 里的指定版本丢失或与新版本冲突。

不同项目需求不同：有时你想要在不同脚本里加载不同版本的编译器/CUDA/MPI；如果 .bashrc 强制加载某些版本，会干扰别的任务或环境。

模块冲突循环切换：正如你看到的，Lmod 不停地警告“一会儿切回旧版本，一会儿又切回新版本”，说明存在依赖冲突或循环切换。

总结

打开 ~/.bashrc 并注释掉任何显式的 module load 命令（尤其带后缀版本号的模块）。

让 .bashrc 文件只保留必要的通用环境配置（如 PATH、别名alias 等），不要锁定 HPC 模块版本。

在需要使用的场景（交互或作业脚本）中，手动或写脚本来 module purge 并加载合适的版本组合。

这样就可以避免每次读取 .bashrc 时的依赖冲突和版本切换警告。

完成上述操作后，重新登录或执行 source ~/.bashrc，应该就不会再提示这些 Lmod 警告了。

总结

你需要先弄清楚：你的 VASP 是用 Intel 还是 OpenMPI 编译的？

Intel 编译 → module load intel/2024.2.0 一条命令就够了；不要同时加载 GCC/CUDA/OpenMPI。

OpenMPI 编译 → module load gcc + cuda + openmpi。

不要把 Intel 和 GCC + OpenMPI 两套环境同时加载；它们不但“有点冲突”，而且没有这个必要。

只要对应好编译环境与运行环境，你的 srun 就可以正常启动 VASP（全部换成mpirun）。

ldd /work/qan/codes/VASP/bin/vasp_std1 可以查看vasp这个程序是什么编译的以及目前是否库完整

Intel 编译

module purge

module load intel/2024.2.0

OpenMPI 编译

module purge

module load gcc/14.2.0-cuda12-vx6uhdf

module load cuda/12.4.1-cz3ljd3

module load openmpi/4.1.6-cuda12-zp2vjqb

(base) [kluo@condo2017 test1]$ cat ~/.bashrc

# .bashrc

# Source global definitions

if [ -f /etc/bashrc ]; then

. /etc/bashrc

# Uncomment the following line if you don't like systemctl's auto-paging feature:

# export SYSTEMD_PAGER=

# User specific aliases and functions

export AWS_DEFAULT_REGION=us-west-2

export PATH=$HOME/.local/bin:$PATH

#export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/shared/hpc/intel/18.3/mkl/lib/intel64

# >>> conda initialize >>>

# !! Contents within this block are managed by 'conda init' !!

__conda_setup="$('/work/qan-free/kluo/anaconda3/bin/conda' 'shell.bash' 'hook' 2> /dev/null)"

if [ $? -eq 0 ]; then

eval "$__conda_setup"

else

if [ -f "/work/qan-free/kluo/anaconda3/etc/profile.d/conda.sh" ]; then

. "/work/qan-free/kluo/anaconda3/etc/profile.d/conda.sh"

else

export PATH="/work/qan-free/kluo/anaconda3/bin:$PATH"

unset __conda_setup

# <<< conda initialize <<<

(base) [kluo@condo2017 test1]$

lammps 全部用1就行

Nova建议deepmd train设置

export OMP_NUM_THREADS=16

export TF_INTRA_OP_PARALLELISM_THREADS=16

export TF_INTER_OP_PARALLELISM_THREADS=4

Anvil 建议：
export OMP_NUM_THREADS=1
export TF_INTRA_OP_PARALLELISM_THREADS=1
export TF_INTER_OP_PARALLELISM_THREADS=128

hikun的地下仓库

搜索此博客

节点,常用程序计算配置设置

标签

评论

发表评论

此博客中的热门博文

lammps 压痕划痕模拟设置参考

lammps Pdamp,Tdamp的设置经验

dpgen训练经验