Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

update Paddle USERNAME repo #6

Merged
merged 185 commits into from
Nov 15, 2020
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
185 commits
Select commit Hold shift + click to select a range
d43f75e
add rois_num for roi_align xpu OP (#28077)
LDOUBLEV Oct 20, 2020
af70924
fix test_group_norm_op_v2.py, test=develop (#28104)
frankwhzhang Oct 20, 2020
5a589b2
reduce imperative ocr attention config; test=develop (#28079)
phlrain Oct 20, 2020
74fadeb
Add Ubuntu18 dockerfile with cuda11 (#28083)
tianshuo78520a Oct 20, 2020
6dd64b0
randperm run error in multi-gpus (#27942)
zhupengyang Oct 20, 2020
135b62a
[Dy2stat] Refine code of DygraphToStaticAst (#28103)
Aurelius84 Oct 20, 2020
74c8a81
Add pil backend for vision transforms (#28035)
LielinJiang Oct 20, 2020
a911c19
fill_constant op supports NaN and Inf (#28109)
zhiqiu Oct 20, 2020
80c5d23
Fix the name error and exit caused by judgment failed, test=document_…
Avin0323 Oct 20, 2020
afe68cb
unset proxy for block file in bos (#28119)
XieYunshen Oct 20, 2020
d1e1f17
fix generate_proposal_labels in cascade-rcnn series model, test=devel…
jerrywgz Oct 20, 2020
d87d286
Add build paddle inference (#28131)
tianshuo78520a Oct 20, 2020
cd37244
disable test_dist_mnist_hallreduce, test=develop (#28129)
Oct 20, 2020
446d184
Add new api: is_tensor (#28111)
Ray2020BD Oct 21, 2020
5d70002
fix dynamic_loader more safe and error message on windows (#28117)
zhwesky2010 Oct 21, 2020
085b961
fix bugs test=develop (#28125)
licanisme Oct 21, 2020
5289b72
fix Wmaybe-uninitialized warning in pooling.cc, test=develop (#28126)
LDOUBLEV Oct 21, 2020
602d2ce
change avg pooling from trt plugin to trt layer (#28032)
cryoco Oct 21, 2020
5cd97a1
support multiclass nms for multi-batch, test=develop (#28154)
jerrywgz Oct 21, 2020
f29fb39
dygraph nccl init support host domain name (#28107)
danleifeng Oct 21, 2020
2d45d9a
add static_mode_white_list (#28112)
pangyoki Oct 21, 2020
bc46069
fix dynamic decode imperative (#28160)
FrostML Oct 21, 2020
68c473e
fix Automatic GPU detection failed on windows (#28148)
zhwesky2010 Oct 21, 2020
7c1aa0d
2.0rc api rename (#28088)
Oct 21, 2020
e8db441
delete matplotlib (#28159)
swtkiwi Oct 21, 2020
4873c20
modify ut cmakefile (#28140)
Oct 21, 2020
5d73bfd
fix test_weight_decay_extend error (#28178)
chenwhql Oct 21, 2020
f935ca8
[lite-xpu-subgraph] Fix xpu compile and test xpu ci. (#27932)
jiweibo Oct 21, 2020
c226b2e
update dockerfile (#27589)
tianshuo78520a Oct 22, 2020
ee4309e
upgrade shellcheck version (#28021)
gongweibao Oct 22, 2020
e450823
Fix nccl op test failed, test=develop (#28172)
wangxicoding Oct 22, 2020
68449d1
Update hapi predict interface (#28180)
LielinJiang Oct 22, 2020
e730516
[Dy2stat] Refine return mechanism in @to_static (#28116)
Aurelius84 Oct 22, 2020
1f3be85
Fix bug of fetch_async_op_handle when fetching the feed variable (#28…
zhiqiu Oct 22, 2020
271ee58
Enhance build detection (#28123)
zhwesky2010 Oct 22, 2020
efe6e28
fix strided_slice_op's GetExpectedKernelType (#28192)
Oct 22, 2020
2db77be
fix wrong data type, test=develop (#28203)
LDOUBLEV Oct 22, 2020
11acbfa
refine auto strategy, test=document_fix (#28211)
wangxicoding Oct 22, 2020
23b0190
add xpu white_list, test=kunlun (#28210)
pangyoki Oct 22, 2020
d835118
Hide log message (#28220)
tianshuo78520a Oct 23, 2020
a1e7fd4
Fix test_parallel_executor_test_while_train Random Failure by Decreas…
zhhsplendid Oct 23, 2020
4ea2330
use FLAGS_use_mkldnn to prevent unnecessary attrs copy (#28146)
lidanqing-intel Oct 23, 2020
2babd6f
Add compile limit for PADDLE_ENFORCE without error message (#28221)
chenwhql Oct 23, 2020
4877bd5
fix CUDA9 error due to BuildCustomizations (#28222)
zhwesky2010 Oct 23, 2020
81244fb
add sharding strategy in fleet(#27900)
mapingshuo Oct 26, 2020
11089ca
Fix xpu notest (#28204)
tianshuo78520a Oct 26, 2020
95ac49c
add new dockerfile for paddle and cinn setup (#28225)
Joejiong Oct 26, 2020
f4f823c
Refine the format of printing tensor 2 (#28216)
zhiqiu Oct 26, 2020
d252219
add git mirror url to speed up clone (#28241)
XiaoguangHu01 Oct 26, 2020
9940871
[cherry pick ] cherry pick 28108 28198 28199 from release2.0rc (#28215)
Oct 26, 2020
fb7f852
fix print tensor place,add cpu/cuda/pin_memory API for Tensor (#28200)
zhwesky2010 Oct 26, 2020
7db747d
oneDNN BatchNorm + Act fusion pass. (#27912)
arogowie-intel Oct 26, 2020
4671d85
fix DataLoader return same format between static & dynamic in single …
heavengate Oct 26, 2020
7a3a05c
[Dy2Stat]Support to save model with nested output (#28224)
Aurelius84 Oct 26, 2020
a5c1820
[Dy2stat]Join break cond with while cond in some pattern (#28171)
Aurelius84 Oct 26, 2020
813b2ad
Enrich the python error types of paddle & polish format (#28124)
chenwhql Oct 27, 2020
fdc06f2
add Fuse bn add act pass (#28196)
zhangting2020 Oct 27, 2020
96ae48b
Change CI dockerfile (#28236)
tianshuo78520a Oct 27, 2020
b1eb28d
[Dy2Stat-log] Call warnings.warn() to display the warning-message onl…
liym27 Oct 27, 2020
cdadc8f
refine temporal_shift_op for performance optimization using gpu kerne…
wangchaochaohu Oct 27, 2020
6905608
refine yolo box Op for performace optimization (#28155)
wangchaochaohu Oct 27, 2020
495a9ce
fix the input error of size Op (#28272)
wangchaochaohu Oct 27, 2020
7fcb32d
fill_constant op supports NINF (#28270)
zhiqiu Oct 27, 2020
41d26a8
update matrix nms op to api 2.0 (#28265)
wangxinxin08 Oct 28, 2020
8f87c7e
fix judge bug of errorlevel on cmd (#28271)
zhwesky2010 Oct 28, 2020
d932b56
set random seed to in test_export_deploy_model (#28274)
LiuChiachi Oct 28, 2020
c11d9b3
[oneDNN ] conv2d fwd&bwd optimization (#27871)
jczaja Oct 28, 2020
0b678d4
- sum (#28233)
jczaja Oct 28, 2020
b63e0cc
fix load check_point bug of LinearWarmup (#28280)
zhwesky2010 Oct 28, 2020
8f83d5d
fix AMP auto_cast and grad_scaler En doc (#28177)
pangyoki Oct 28, 2020
4dc8c44
【Paddle.Fleet】Fix fleetrun heter (#28252)
MrChengmo Oct 28, 2020
2853f0c
Set static shape for shape tensor with constant [part 1] (#28275)
zhiqiu Oct 28, 2020
e1e666a
fix conv mkldnn build error (#28288)
luotao1 Oct 28, 2020
a98c69b
fix dygraph gather api
ZHUI Oct 28, 2020
6cebd71
add + - * / @ [] operator to ComplexVariable (#28217)
MingMingShangTian Oct 28, 2020
8953038
Fix transpose in conv cudnn kernel when addto enabled (#28295)
zhiqiu Oct 28, 2020
0351168
Fix lr scheduler step error in hapi when use static mode (#28297)
LielinJiang Oct 29, 2020
842a4e5
fix analyzer_capi_tester, test=develop (#28289)
Shixiaowei02 Oct 29, 2020
e8f2614
Enhance multiclass_nms op to support LoD for dygraph mode (#28276)
yghstill Oct 29, 2020
571a63e
Add bf16 transpose2, reshape2, concat ops (#28195)
wozna Oct 29, 2020
3ccc0a2
enable test_parallel_executor_fetch_isolated_var (#28219)
wanghuancoder Oct 29, 2020
f763cb8
Modify the shell script according to the specification (#28302)
iducn Oct 29, 2020
e1fb467
move cinn dockerfile to dockerfile folder (#28281)
Joejiong Oct 29, 2020
1c385e2
add op_function_generator for box_coder (#28303)
jerrywgz Oct 29, 2020
8cd1c10
Enable GRU infer model running CAPI (#28313)
lidanqing-intel Oct 29, 2020
26ede6e
Add median api. (#28310)
Ray2020BD Oct 30, 2020
fb1e0c9
Make vision datasets return PIL.Image as default (#28264)
LielinJiang Oct 30, 2020
4086f48
Check and fix tensor and scalar type promotion (#28299)
chenwhql Oct 30, 2020
18c86fb
hide some logs of p2p (#28307)
zhiqiu Oct 30, 2020
d9b5f12
update the version of pybind, test=develop (#28284)
Shixiaowei02 Oct 30, 2020
57e4411
[Dy2stat] Support to modify value of buffer tensor (#28328)
Aurelius84 Nov 2, 2020
b96869b
Fix lr setting of AdamW when lr is an instance of LRScheduler (#28300)
guoshengCS Nov 2, 2020
5262b02
add generate_proposals_v2 op (#28214)
jerrywgz Nov 2, 2020
acc11c2
Retry CUDA Initialization to Fix Random Failure, test=develop (#28323)
zhhsplendid Nov 2, 2020
0f4b624
refine the gpu config for performance optimization (#28291)
wangchaochaohu Nov 2, 2020
9a600df
Add rnn_op (#28197)
guoshengCS Nov 3, 2020
f41104e
fix compile out of memory temporary (#28346)
zhwesky2010 Nov 3, 2020
6115c14
Pool2d cuda kernel supports fp16 (#28316)
zhiqiu Nov 3, 2020
17db031
fix pool bug, test=develop (#28359)
LDOUBLEV Nov 3, 2020
953302d
add musl docker build script (#28027)
chen-zhiyu Nov 3, 2020
09fd2b2
Paddle support compile on sw (#27858)
jiweibo Nov 3, 2020
6f0f45f
copy_to_cpu support uint8 (#28372)
jiweibo Nov 3, 2020
84cc61b
[oneDNN] sum op refactor (#28318)
jczaja Nov 3, 2020
ea85179
TensorRT中ernie模型推理性能优化,支持变长输入 (#28367)
shangzhizhou Nov 3, 2020
c1c3e21
retry will not be executed when the number of failed ut is greater th…
XieYunshen Nov 4, 2020
21a63f6
enhance the op_version_registry, test=develop (#28347)
Shixiaowei02 Nov 4, 2020
337d383
refine (#28366)
wangchaochaohu Nov 4, 2020
8b2436a
Add broadcast_shape api (#28257)
zhiqiu Nov 4, 2020
71d6220
Skip reader op in mixed_precision decorator (#28353)
zhiqiu Nov 4, 2020
95b1868
update DataLoader doc (#28290)
heavengate Nov 4, 2020
0511469
[Inference] Memory modification for ShrinkMemory. (#28355)
jiweibo Nov 4, 2020
f401907
modified sample code of CTCLoss & ctc_loss by remove disable_static()…
Nov 4, 2020
0d25d55
update the cmake cmd, test=develop (#28344)
Shixiaowei02 Nov 4, 2020
12b9587
Add conv_bias pass version python test (#28278)
lidanqing-intel Nov 4, 2020
44a476c
support cuda pinned place (#28416)
zhiqiu Nov 4, 2020
3c95acc
Remove cv2 dependence of normalize (#28361)
LielinJiang Nov 4, 2020
23439b1
show cpp stack when catch signal (#28415)
chenwhql Nov 4, 2020
a430349
[DOC] activation api doc, test=document_fix (#28405)
qili93 Nov 4, 2020
463075a
add paddle.io.ComposeDataset & paddle.io.ChainDataset (#28311)
heavengate Nov 4, 2020
b89b4e3
add fashion dataset (#28411)
LielinJiang Nov 4, 2020
648b92c
[sw] Update compile error for sw (#28419)
jiweibo Nov 4, 2020
ca41541
[oneDNN]Sum bf16 kernel (#28382)
jczaja Nov 5, 2020
2500dca
[Dy2Stat] Fix bug in convert_call (#28368)
Aurelius84 Nov 5, 2020
c41fd03
check op_version_registry in CI test, test=develop (#28402)
Shixiaowei02 Nov 5, 2020
c42e656
Add retry for dygraph parallel socket bind (#28404)
chenwhql Nov 5, 2020
0155f91
enable softmax unittest (#28362)
zhupengyang Nov 5, 2020
dc6b232
remove unused pyc file (#28449)
luotao1 Nov 5, 2020
f683403
fix sample code (#28446)
Nov 5, 2020
bd8dfe3
[Dy2Stat] Refine code example for 2.0 (#28440)
Aurelius84 Nov 6, 2020
ba0fe0a
revert the modified shell script (#28453)
iducn Nov 6, 2020
ba036b8
remove shellcheck test=develop (#28457)
gongweibao Nov 6, 2020
7821759
Add bfloat16 softmax and gelu (#28394)
wozna Nov 6, 2020
ced5c40
Update memory release interface. (#28456)
jiweibo Nov 6, 2020
6bba8e5
fix batch_norm_xpu bug & remove xpusimulator dependence (#28430)
QingshuChen Nov 6, 2020
fad4744
fix crash in adam in xpu, *test=kunlun (#28433)
taixiurong Nov 6, 2020
7fe5f9c
Fix unittest random failure (#28363)
zhangting2020 Nov 6, 2020
b5e662f
refine jit.save/load to add support for other method, not only forwar…
hbwx24 Nov 6, 2020
155b4f9
Remove selected rows all reduce over height check (#28460)
chenwhql Nov 6, 2020
ba07563
exec ut no more than 15s 1 (#28439)
XieYunshen Nov 8, 2020
72c78e4
exec ut no more than 15s 2 (#28441)
XieYunshen Nov 8, 2020
e29ab5e
clear clcache cache file and reopen clcache (#28384)
wanghuancoder Nov 9, 2020
e14ed71
refine the performance of gather Op (#28458)
wangchaochaohu Nov 9, 2020
18525d3
fix Tanh remainder en doc (#28455)
wangxicoding Nov 9, 2020
4fa1d39
Fix en doc for rnn.py. test=document_fix (#28470)
smallv0221 Nov 9, 2020
7fd2077
set NCCL_SHM_DISABLE=1 for test_parallel_executor_profilery.py (#28484)
luotao1 Nov 9, 2020
faa6549
setting timeout properity for test_flags_mkldnn_ops_on_off (#28489)
XieYunshen Nov 9, 2020
1e698c6
fix cmake error when setting ut timeout properity (#28492)
XieYunshen Nov 9, 2020
d3b2d07
modified timeout value on windows (#28499)
XieYunshen Nov 9, 2020
e1a5fc4
fix ut exec timeout notest,test=kunlun (#28495)
XieYunshen Nov 9, 2020
eb08556
Set Exclusive for test_yolov3 to Avoid Memory Segmentation Fault in C…
zhhsplendid Nov 9, 2020
645e999
fix api_impl test. (#28483)
jiweibo Nov 9, 2020
2159646
modified timeout value for test_conv3d_mkldnn_op,test=document_fix (#…
XieYunshen Nov 9, 2020
241505c
refine the doc for API2.0 (#28386)
wangchaochaohu Nov 10, 2020
369605b
fix cmake error when execute build_inference_lib (#28503)
XieYunshen Nov 10, 2020
53e9aa9
remove diff with develop (#28504)
zhwesky2010 Nov 10, 2020
8f664a5
fix one_hot example code (#28432)
yukavio Nov 10, 2020
546b1c1
Fix paddle.jit.dy2static.data_layer_not_check 2.0 API Doc (#28461)
zhhsplendid Nov 10, 2020
dc45561
Update version docker (#28314)
tianshuo78520a Nov 10, 2020
47cbf61
fix softmax unittest float16 random error (#28480)
zhupengyang Nov 10, 2020
c70c1c5
make Numpy version is below 1.19.3 (#28510)
zhwesky2010 Nov 10, 2020
75196cd
Paddle-TRT int8 support mul op channelwise quant (#28422)
cryoco Nov 10, 2020
3b0d31a
Modify ProgramTranslator and TracedLayer Doc for API 2.0 (#28509)
zhhsplendid Nov 11, 2020
0ce933a
[API2.0] Fix documents of 6 APIs to fit API2.0: (#28514)
liym27 Nov 11, 2020
98dc11b
add monitoring for executive ut at night (#28377)
XieYunshen Nov 11, 2020
d7cfee9
Checkout point add (#28488)
wangchaochaohu Nov 11, 2020
5305b27
deprecated APIs under paddle.dataset. test=develop (#28423)
heavengate Nov 11, 2020
621b31c
modified timeout value for test_resnet_v2 and test_resnet (#28532)
XieYunshen Nov 11, 2020
c52fe48
fix the GetKernelTypeForVar of input for fluid.gather (#28534)
wangchaochaohu Nov 11, 2020
26d292b
bugfix for api (mv, empty, empty_like op) (#28513)
windstamp Nov 11, 2020
1bf4836
[Inference] Add TryShrinkMemory interface. (#28409)
jiweibo Nov 11, 2020
543ff33
Refine the format of printing tensor 3 (support scaler tensor) (#28544)
zhiqiu Nov 11, 2020
b258caf
fix add_n doc (eng) (#28464)
Steffy-zxf Nov 12, 2020
0fc181d
[Fix bug] If the pass name is not found, IsCompatible should return f…
lidanqing-intel Nov 12, 2020
08d2413
add log2 operator (#28319)
Joejiong Nov 12, 2020
8699f38
裁剪transformer模型trt支持;修复tensorRT不支持DeletePass的bug (#28517)
shangzhizhou Nov 12, 2020
614f20f
Update `add` `clip` `MSELoss` and `no_grad` (#28530)
willthefrog Nov 12, 2020
849467b
fix user set CUDA_VISIBLE_DEVICES start/end with quotation marks (#28…
zhwesky2010 Nov 13, 2020
7b1619e
disable test_trt_dynamic_shape_transformer_prune,test=document_fix (#…
XieYunshen Nov 13, 2020
bf6e7cb
updata 2.0 API english doc (#28525)
zhwesky2010 Nov 13, 2020
c545b9b
Add ONNX Exporter (#27831)
Nov 13, 2020
c47bfe9
Reduce input data size for syncbn unit test by 8x (#28571)
willthefrog Nov 13, 2020
5579edf
save dtype of inputs (#28301)
LiuChiachi Nov 13, 2020
a829357
register the op version for some ops
ZHUI Nov 13, 2020
ed9dd7c
add send and recv ops (#28590)
Nov 13, 2020
1adc09b
Make Batch Size Smaller on Mac Because of CI Machine (#28569)
zhhsplendid Nov 13, 2020
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
8 changes: 0 additions & 8 deletions .pre-commit-config.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -50,11 +50,3 @@ repos:
language: system
files: \.(c|cc|cxx|cpp|cu|h|hpp|hxx|proto|py|sh)$
exclude: (?!.*third_party)^.*$ | (?!.*book)^.*$
- repo: local
hooks:
- id: shellcheck
name: shellcheck
entry: shellcheck
language: system
files: .sh$
exclude: (paddle_build.sh|fast_install.sh|check_file_diff_approvals.sh)
45 changes: 30 additions & 15 deletions CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -98,6 +98,13 @@ else(WIN32)
endif(WIN32)

find_package(Git REQUIRED)

# config GIT_URL with github mirrors to speed up dependent repos clone
option(GIT_URL "Git URL to clone dependent repos" ${GIT_URL})
if(NOT GIT_URL)
set(GIT_URL "https://github.com")
endif()

find_package(Threads REQUIRED)

include(simd)
Expand Down Expand Up @@ -131,6 +138,7 @@ option(WITH_LITE "Compile Paddle Fluid with Lite Engine" OFF)
option(WITH_NCCL "Compile PaddlePaddle with NCCL support" ON)
option(WITH_CRYPTO "Compile PaddlePaddle with crypto support" ON)
option(WITH_ARM "Compile PaddlePaddle with arm support" OFF)
option(WITH_SW "Compile PaddlePaddle with sw support" OFF)
option(WITH_MUSL "Compile with musl libc instead of gblic" OFF)

# PY_VERSION
Expand Down Expand Up @@ -193,10 +201,19 @@ if(WITH_BRPC_RDMA)
endif()
endif()

# lite subgraph compilation depends on CUDNN_ROOT,
# so include(cudnn) needs to be in front of include(third_party/lite)
include(cudnn) # set cudnn libraries, must before configure
include(third_party) # download, build, install third_party
if(WITH_GPU)
include(cuda)
# lite subgraph compilation depends on CUDNN_ROOT,
# so include(cudnn) needs to be in front of include(third_party/lite)
include(cudnn) # set cudnn libraries, must before configure
include(tensorrt)
# there is no official support of nccl, cupti in windows
if(NOT WIN32)
include(cupti)
endif()
endif()

include(third_party) # download, build, install third_party, Contains about 20+ dependencies

if(WITH_DISTRIBUTE)
if(WITH_GRPC)
Expand All @@ -209,18 +226,8 @@ if(WITH_DISTRIBUTE)
endif()
endif()

# there is no official support of nccl, cupti in windows
if(NOT WIN32)
include(cupti)
endif()

include(flags) # set paddle compile flags

if(WITH_GPU)
include(cuda)
include(tensorrt)
endif()

if(WITH_PROFILER)
find_package(Gperftools REQUIRED)
include_directories(${GPERFTOOLS_INCLUDE_DIR})
Expand Down Expand Up @@ -251,10 +258,18 @@ if(WITH_ARM)
set(CMAKE_CXX_FLAGS "${CMAKE_CXX_FLAGS} -fPIC")
set(WITH_XBYAK OFF CACHE STRING "Disable XBYAK when compiling WITH_ARM=ON" FORCE)
set(WITH_MKL OFF CACHE STRING "Disable MKL when compiling WITH_ARM=ON." FORCE)
set(WITH_GPU OFF CACHE STRING "Disable GPU when compiling WITH_ARM=ON." FORCE)
add_definitions(-DPADDLE_WITH_ARM)
endif()

if (WITH_SW)
# mieee flag solves floating-point exceptions under sw and ALPHA architectures
set(CMAKE_C_FLAGS "${CMAKE_C_FLAGS} -fPIC -mieee")
set(CMAKE_CXX_FLAGS "${CMAKE_CXX_FLAGS} -fPIC -mieee")
set(WITH_XBYAK OFF CACHE STRING "Disable XBYAK when compiling WITH_SW=ON" FORCE)
set(WITH_MKL OFF CACHE STRING "Disable MKL when compiling WITH_SW=ON." FORCE)
add_definitions(-DPADDLE_WITH_SW)
endif()

set(PADDLE_PYTHON_BUILD_DIR "${CMAKE_CURRENT_BINARY_DIR}/python/build")

set(CMAKE_CXX_FLAGS_RELWITHDEBINFO "-O3 -g -DNDEBUG")
Expand Down
7 changes: 1 addition & 6 deletions Dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -156,16 +156,12 @@ RUN pip3 --no-cache-dir install -U wheel py-cpuinfo==5.0.0 && \

RUN pip3 --no-cache-dir install 'pre-commit==1.10.4' 'ipython==5.3.0' && \
pip3 --no-cache-dir install 'ipykernel==4.6.0' 'jupyter==1.0.0' && \
pip3 --no-cache-dir install opencv-python && \
pip3.6 --no-cache-dir install 'pre-commit==1.10.4' 'ipython==5.3.0' && \
pip3.6 --no-cache-dir install 'ipykernel==4.6.0' 'jupyter==1.0.0' && \
pip3.6 --no-cache-dir install opencv-python && \
pip3.7 --no-cache-dir install 'pre-commit==1.10.4' 'ipython==5.3.0' && \
pip3.7 --no-cache-dir install 'ipykernel==4.6.0' 'jupyter==1.0.0' && \
pip3.7 --no-cache-dir install opencv-python && \
pip --no-cache-dir install 'pre-commit==1.10.4' 'ipython==5.3.0' && \
pip --no-cache-dir install 'ipykernel==4.6.0' 'jupyter==1.0.0' && \
pip --no-cache-dir install opencv-python
pip --no-cache-dir install 'ipykernel==4.6.0'

#For docstring checker
RUN pip3 --no-cache-dir install pylint pytest astroid isort
Expand Down Expand Up @@ -207,7 +203,6 @@ RUN wget --no-check-certificate https://pslib.bj.bcebos.com/openmpi-1.4.5.tar.gz
export LD_LIBRARY_PATH=/usr/local/lib/:$LD_LIBRARY_PATH && export PATH=/usr/local/bin:$PATH && cd .. && \
rm -rf openmpi-1.4.5.tar.gz && pip --no-cache-dir install mpi4py && ln -fs /bin/bash /bin/sh && \
apt-get install libprotobuf-dev -y
RUN pip --no-cache-dir install -U netifaces==0.10.9

# Older versions of patchelf limited the size of the files being processed and were fixed in this pr.
# https://github.com/NixOS/patchelf/commit/ba2695a8110abbc8cc6baf0eea819922ee5007fa
Expand Down
6 changes: 5 additions & 1 deletion cmake/cblas.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -101,6 +101,8 @@ if(NOT DEFINED CBLAS_PROVIDER AND WITH_SYSTEM_BLAS)
${REFERENCE_CBLAS_INCLUDE_SEARCH_PATHS})
find_library(REFERENCE_CBLAS_LIBRARY NAMES cblas PATHS
${REFERENCE_CBLAS_LIB_SEARCH_PATHS})
find_library(REFERENCE_BLAS_LIBRARY NAMES blas PATHS
${REFERENCE_CBLAS_LIB_SEARCH_PATHS})

if(REFERENCE_CBLAS_INCLUDE_DIR AND REFERENCE_CBLAS_LIBRARY)
set(CBLAS_PROVIDER REFERENCE_CBLAS)
Expand All @@ -125,7 +127,9 @@ endif()
# linear algebra libraries for cc_library(xxx SRCS xxx.c DEPS cblas)

include_directories(${CBLAS_INC_DIR})
if(NOT ${CBLAS_PROVIDER} STREQUAL MKLML)
if(${CBLAS_PROVIDER} STREQUAL REFERENCE_CBLAS)
target_link_libraries(cblas gfortran ${CBLAS_LIBRARIES} ${REFERENCE_BLAS_LIBRARY})
elseif(NOT ${CBLAS_PROVIDER} STREQUAL MKLML)
target_link_libraries(cblas ${CBLAS_LIBRARIES})
endif()

9 changes: 4 additions & 5 deletions cmake/cuda.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -62,10 +62,6 @@ function(detect_installed_gpus out_variable)
if(NOT CUDA_gpu_detect_output)
message(STATUS "Automatic GPU detection failed. Building for all known architectures.")
set(${out_variable} ${paddle_known_gpu_archs} PARENT_SCOPE)
#Todo: fix Automatic GPU detection failed on windows
if(WIN32)
set(${out_variable} "61 75" PARENT_SCOPE)
endif()
else()
set(${out_variable} ${CUDA_gpu_detect_output} PARENT_SCOPE)
endif()
Expand Down Expand Up @@ -198,7 +194,9 @@ elseif (${CMAKE_CUDA_COMPILER_VERSION} LESS 12.0) # CUDA 11.x
set(CMAKE_CUDA_FLAGS "${CMAKE_CUDA_FLAGS} -D__STRICT_ANSI__")
endif()

add_definitions("-DPADDLE_CUDA_BINVER=\"${CUDA_VERSION_MAJOR}${CUDA_VERSION_MINOR}\"")
add_definitions("-DCUDA_VERSION_MAJOR=\"${CUDA_VERSION_MAJOR}\"")
add_definitions("-DCUDA_VERSION_MINOR=\"${CUDA_VERSION_MINOR}\"")
add_definitions("-DCUDA_TOOLKIT_ROOT_DIR=\"${CUDA_TOOLKIT_ROOT_DIR}\"")

# setting nvcc arch flags
select_nvcc_arch_flags(NVCC_FLAGS_EXTRA)
Expand Down Expand Up @@ -249,3 +247,4 @@ endif()

mark_as_advanced(CUDA_BUILD_CUBIN CUDA_BUILD_EMULATION CUDA_VERBOSE_BUILD)
mark_as_advanced(CUDA_SDK_ROOT_DIR CUDA_SEPARABLE_COMPILATION)

11 changes: 6 additions & 5 deletions cmake/cudnn.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -35,17 +35,18 @@ list(APPEND CUDNN_CHECK_LIBRARY_DIRS
${CUDA_TOOLKIT_ROOT_DIR}/lib/x64
)
set(CUDNN_LIB_NAME "")

if (LINUX)
set(CUDNN_LIB_NAME "libcudnn.so")
set(CUDNN_LIB_NAME "libcudnn.so")
endif(LINUX)

if(WIN32)
# only support cudnn7
set(CUDNN_LIB_NAME "cudnn.lib" "cudnn64_7.dll")
# only support cudnn7
set(CUDNN_LIB_NAME "cudnn.lib" "cudnn64_7.dll")
endif(WIN32)

if(APPLE)
set(CUDNN_LIB_NAME "libcudnn.dylib" "libcudnn.so")
set(CUDNN_LIB_NAME "libcudnn.dylib" "libcudnn.so")
endif(APPLE)

find_library(CUDNN_LIBRARY NAMES ${CUDNN_LIB_NAME} # libcudnn_static.a
Expand Down Expand Up @@ -88,7 +89,7 @@ macro(find_cudnn_version cudnn_header_file)
if(NOT CUDNN_MAJOR_VERSION)
set(CUDNN_VERSION "???")
else()
add_definitions("-DPADDLE_CUDNN_BINVER=\"${CUDNN_MAJOR_VERSION}\"")
add_definitions("-DCUDNN_MAJOR_VERSION=\"${CUDNN_MAJOR_VERSION}\"")
math(EXPR CUDNN_VERSION
"${CUDNN_MAJOR_VERSION} * 1000 +
${CUDNN_MINOR_VERSION} * 100 + ${CUDNN_PATCHLEVEL_VERSION}")
Expand Down
2 changes: 2 additions & 0 deletions cmake/cupti.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -8,6 +8,7 @@ find_path(CUPTI_INCLUDE_DIR cupti.h
PATHS ${CUPTI_ROOT} ${CUPTI_ROOT}/include
$ENV{CUPTI_ROOT} $ENV{CUPTI_ROOT}/include
${CUDA_TOOLKIT_ROOT_DIR}/extras/CUPTI/include
${CUDA_TOOLKIT_ROOT_DIR}/targets/x86_64-linux/include
NO_DEFAULT_PATH
)

Expand All @@ -27,6 +28,7 @@ list(APPEND CUPTI_CHECK_LIBRARY_DIRS
$ENV{CUPTI_ROOT}/lib64
$ENV{CUPTI_ROOT}/lib
/usr/lib
${CUDA_TOOLKIT_ROOT_DIR}/targets/x86_64-linux/lib64
${CUDA_TOOLKIT_ROOT_DIR}/extras/CUPTI/lib64)
find_library(CUPTI_LIBRARY NAMES libcupti.so libcupti.dylib # libcupti_static.a
PATHS ${CUPTI_CHECK_LIBRARY_DIRS} ${CUPTI_INCLUDE_DIR} ${__libpath_hist}
Expand Down
2 changes: 1 addition & 1 deletion cmake/external/brpc.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -40,7 +40,7 @@ ExternalProject_Add(
extern_brpc
${EXTERNAL_PROJECT_LOG_ARGS}
${SHALLOW_CLONE}
GIT_REPOSITORY "https://github.com/apache/incubator-brpc.git"
GIT_REPOSITORY "${GIT_URL}/apache/incubator-brpc.git"
GIT_TAG "ad00fe940b4f05225b214131959293bbed8744a0" #rdma branch's head now.
PREFIX ${BRPC_SOURCES_DIR}
UPDATE_COMMAND ""
Expand Down
4 changes: 2 additions & 2 deletions cmake/external/cryptopp.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -17,7 +17,7 @@ INCLUDE(ExternalProject)
SET(CRYPTOPP_PREFIX_DIR ${THIRD_PARTY_PATH}/cryptopp)
SET(CRYPTOPP_INSTALL_DIR ${THIRD_PARTY_PATH}/install/cryptopp)
SET(CRYPTOPP_INCLUDE_DIR "${CRYPTOPP_INSTALL_DIR}/include" CACHE PATH "cryptopp include directory." FORCE)
SET(CRYPTOPP_REPOSITORY https://github.com/weidai11/cryptopp.git)
SET(CRYPTOPP_REPOSITORY ${GIT_URL}/weidai11/cryptopp.git)
SET(CRYPTOPP_TAG CRYPTOPP_8_2_0)

IF(WIN32)
Expand Down Expand Up @@ -55,7 +55,7 @@ ExternalProject_Add(
SOURCE_DIR ${CRYPTOPP_SOURCE_DIR}
PATCH_COMMAND
COMMAND ${CMAKE_COMMAND} -E remove_directory "<SOURCE_DIR>/cmake/"
COMMAND git clone https://github.com/noloader/cryptopp-cmake "<SOURCE_DIR>/cmake"
COMMAND git clone ${GIT_URL}/noloader/cryptopp-cmake "<SOURCE_DIR>/cmake"
COMMAND cd "<SOURCE_DIR>/cmake" && git checkout tags/${CRYPTOPP_TAG} -b ${CRYPTOPP_TAG}
COMMAND ${CMAKE_COMMAND} -E copy_directory "<SOURCE_DIR>/cmake/" "<SOURCE_DIR>/"
INSTALL_DIR ${CRYPTOPP_INSTALL_DIR}
Expand Down
2 changes: 1 addition & 1 deletion cmake/external/cub.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -16,7 +16,7 @@ include(ExternalProject)

set(CUB_PREFIX_DIR ${THIRD_PARTY_PATH}/cub)
set(CUB_SOURCE_DIR ${THIRD_PARTY_PATH}/cub/src/extern_cub)
set(CUB_REPOSITORY https://github.com/NVlabs/cub.git)
set(CUB_REPOSITORY ${GIT_URL}/NVlabs/cub.git)
set(CUB_TAG 1.8.0)

cache_third_party(extern_cub
Expand Down
2 changes: 1 addition & 1 deletion cmake/external/dlpack.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -17,7 +17,7 @@ include(ExternalProject)
set(DLPACK_PREFIX_DIR ${THIRD_PARTY_PATH}/dlpack)
set(DLPACK_SOURCE_DIR ${THIRD_PARTY_PATH}/dlpack/src/extern_dlpack)

set(DLPACK_REPOSITORY https://github.com/dmlc/dlpack.git)
set(DLPACK_REPOSITORY ${GIT_URL}/dmlc/dlpack.git)
set(DLPACK_TAG v0.2)

cache_third_party(extern_dlpack
Expand Down
9 changes: 7 additions & 2 deletions cmake/external/eigen.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -22,14 +22,14 @@ set(EIGEN_TAG 4da2c6b1974827b1999bab652a3d4703e1992d26)

# the recent version of eigen will cause compilation error on windows
if(WIN32)
set(EIGEN_REPOSITORY https://github.com/eigenteam/eigen-git-mirror.git)
set(EIGEN_REPOSITORY ${GIT_URL}/eigenteam/eigen-git-mirror.git)
set(EIGEN_TAG 917060c364181f33a735dc023818d5a54f60e54c)
endif()

# eigen on cuda9.1 missing header of math_funtions.hpp
# https://stackoverflow.com/questions/43113508/math-functions-hpp-not-found-when-using-cuda-with-eigen
if(WITH_AMD_GPU)
set(EIGEN_REPOSITORY https://github.com/sabreshao/hipeigen.git)
set(EIGEN_REPOSITORY ${GIT_URL}/sabreshao/hipeigen.git)
set(EIGEN_TAG 7cb2b6e5a4b4a1efe658abb215cd866c6fb2275e)
endif()

Expand Down Expand Up @@ -97,3 +97,8 @@ endif()
add_library(eigen3 INTERFACE)

add_dependencies(eigen3 extern_eigen3)

# sw not support thread_local semantic
if(WITH_SW)
add_definitions(-DEIGEN_AVOID_THREAD_LOCAL)
endif()
2 changes: 1 addition & 1 deletion cmake/external/gflags.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -18,7 +18,7 @@ SET(GFLAGS_PREFIX_DIR ${THIRD_PARTY_PATH}/gflags)
SET(GFLAGS_SOURCE_DIR ${THIRD_PARTY_PATH}/gflags/src/extern_gflags)
SET(GFLAGS_INSTALL_DIR ${THIRD_PARTY_PATH}/install/gflags)
SET(GFLAGS_INCLUDE_DIR "${GFLAGS_INSTALL_DIR}/include" CACHE PATH "gflags include directory." FORCE)
set(GFLAGS_REPOSITORY https://github.com/gflags/gflags.git)
set(GFLAGS_REPOSITORY ${GIT_URL}/gflags/gflags.git)
set(GFLAGS_TAG 77592648e3f3be87d6c7123eb81cbad75f9aef5a)
IF(WIN32)
set(GFLAGS_LIBRARIES "${GFLAGS_INSTALL_DIR}/lib/gflags_static.lib" CACHE FILEPATH "GFLAGS_LIBRARIES" FORCE)
Expand Down
2 changes: 1 addition & 1 deletion cmake/external/glog.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -18,7 +18,7 @@ SET(GLOG_PREFIX_DIR ${THIRD_PARTY_PATH}/glog)
SET(GLOG_SOURCE_DIR ${THIRD_PARTY_PATH}/glog/src/extern_glog)
SET(GLOG_INSTALL_DIR ${THIRD_PARTY_PATH}/install/glog)
SET(GLOG_INCLUDE_DIR "${GLOG_INSTALL_DIR}/include" CACHE PATH "glog include directory." FORCE)
SET(GLOG_REPOSITORY https://github.com/google/glog.git)
SET(GLOG_REPOSITORY ${GIT_URL}/google/glog.git)
SET(GLOG_TAG v0.3.5)

IF(WIN32)
Expand Down
2 changes: 1 addition & 1 deletion cmake/external/gloo.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -21,7 +21,7 @@ SET(GLOO_INSTALL_DIR ${THIRD_PARTY_PATH}/install/gloo)
SET(GLOO_INCLUDE_DIR "${GLOO_INSTALL_DIR}/include" CACHE PATH "gloo include directory." FORCE)
SET(GLOO_LIBRARY_DIR "${GLOO_INSTALL_DIR}/lib" CACHE PATH "gloo library directory." FORCE)
# As we add extra features for gloo, we use the non-official repo
SET(GLOO_REPOSITORY https://github.com/sandyhouse/gloo.git)
SET(GLOO_REPOSITORY ${GIT_URL}/sandyhouse/gloo.git)
SET(GLOO_TAG v0.0.2)
SET(GLOO_LIBRARIES "${GLOO_INSTALL_DIR}/lib/libgloo.a" CACHE FILEPATH "gloo library." FORCE)

Expand Down
2 changes: 1 addition & 1 deletion cmake/external/gtest.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -25,7 +25,7 @@ SET(GTEST_PREFIX_DIR ${THIRD_PARTY_PATH}/gtest)
SET(GTEST_SOURCE_DIR ${THIRD_PARTY_PATH}/gtest/src/extern_gtest)
SET(GTEST_INSTALL_DIR ${THIRD_PARTY_PATH}/install/gtest)
SET(GTEST_INCLUDE_DIR "${GTEST_INSTALL_DIR}/include" CACHE PATH "gtest include directory." FORCE)
set(GTEST_REPOSITORY https://github.com/google/googletest.git)
set(GTEST_REPOSITORY ${GIT_URL}/google/googletest.git)
set(GTEST_TAG release-1.8.1)

INCLUDE_DIRECTORIES(${GTEST_INCLUDE_DIR})
Expand Down
2 changes: 1 addition & 1 deletion cmake/external/leveldb.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -25,7 +25,7 @@ ExternalProject_Add(
${EXTERNAL_PROJECT_LOG_ARGS}
${SHALLOW_CLONE}
PREFIX ${LEVELDB_SOURCES_DIR}
GIT_REPOSITORY "https://github.com/google/leveldb.git"
GIT_REPOSITORY "${GIT_URL}/google/leveldb.git"
GIT_TAG v1.18
CONFIGURE_COMMAND ""
BUILD_COMMAND CXXFLAGS=-fPIC make -j ${NUM_OF_PROCESSOR} libleveldb.a
Expand Down
2 changes: 1 addition & 1 deletion cmake/external/libxsmm.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -24,7 +24,7 @@ SET(LIBXSMM_LIBS "${LIBXSMM_LIBRARY_DIR}/libxsmm.a"
ExternalProject_Add(
extern_libxsmm
${SHALLOW_CLONE}
GIT_REPOSITORY "https://github.com/hfp/libxsmm.git"
GIT_REPOSITORY "${GIT_URL}/hfp/libxsmm.git"
GIT_TAG "7cc03b5b342fdbc6b6d990b190671c5dbb8489a2"
PREFIX ${LIBXSMM_SOURCES_DIR}
UPDATE_COMMAND ""
Expand Down
6 changes: 3 additions & 3 deletions cmake/external/lite.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -22,7 +22,7 @@ if(XPU_SDK_ROOT)
set(LITE_WITH_XPU ON)
include_directories("${XPU_SDK_ROOT}/XTDK/include")
include_directories("${XPU_SDK_ROOT}/XTCL/include")
add_definitions(-DPADDLE_WITH_XPU)
add_definitions(-DLITE_SUBGRAPH_WITH_XPU)
LINK_DIRECTORIES("${XPU_SDK_ROOT}/XTDK/shlib/")
LINK_DIRECTORIES("${XPU_SDK_ROOT}/XTDK/runtime/shlib/")
endif()
Expand Down Expand Up @@ -62,7 +62,7 @@ if (NOT LITE_SOURCE_DIR OR NOT LITE_BINARY_DIR)
ExternalProject_Add(
${LITE_PROJECT}
${EXTERNAL_PROJECT_LOG_ARGS}
GIT_REPOSITORY "https://github.com/PaddlePaddle/Paddle-Lite.git"
GIT_REPOSITORY "${GIT_URL}/PaddlePaddle/Paddle-Lite.git"
GIT_TAG ${LITE_GIT_TAG}
PREFIX ${LITE_SOURCES_DIR}
PATCH_COMMAND mkdir -p ${LITE_SOURCES_DIR}/src/extern_lite-build/lite/gen_code && touch ${LITE_SOURCES_DIR}/src/extern_lite-build/lite/gen_code/__generated_code__.cc
Expand Down Expand Up @@ -106,7 +106,7 @@ if (NOT LITE_SOURCE_DIR OR NOT LITE_BINARY_DIR)
ExternalProject_Add(
${LITE_PROJECT}
${EXTERNAL_PROJECT_LOG_ARGS}
GIT_REPOSITORY "https://github.com/PaddlePaddle/Paddle-Lite.git"
GIT_REPOSITORY "${GIT_URL}/PaddlePaddle/Paddle-Lite.git"
GIT_TAG ${LITE_GIT_TAG}
PREFIX ${LITE_SOURCES_DIR}
UPDATE_COMMAND ""
Expand Down
2 changes: 1 addition & 1 deletion cmake/external/mkldnn.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -19,7 +19,7 @@ SET(MKLDNN_PREFIX_DIR ${THIRD_PARTY_PATH}/mkldnn)
SET(MKLDNN_SOURCE_DIR ${THIRD_PARTY_PATH}/mkldnn/src/extern_mkldnn)
SET(MKLDNN_INSTALL_DIR ${THIRD_PARTY_PATH}/install/mkldnn)
SET(MKLDNN_INC_DIR "${MKLDNN_INSTALL_DIR}/include" CACHE PATH "mkldnn include directory." FORCE)
SET(MKLDNN_REPOSITORY https://github.com/oneapi-src/oneDNN.git)
SET(MKLDNN_REPOSITORY ${GIT_URL}/oneapi-src/oneDNN.git)
SET(MKLDNN_TAG 361725600224f41b7347a1c6bee9b04d1e6c14d7)

# Introduce variables:
Expand Down
2 changes: 1 addition & 1 deletion cmake/external/openblas.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -17,7 +17,7 @@ INCLUDE(ExternalProject)
SET(CBLAS_PREFIX_DIR ${THIRD_PARTY_PATH}/openblas)
SET(CBLAS_SOURCE_DIR ${THIRD_PARTY_PATH}/openblas/src/extern_openblas)
SET(CBLAS_INSTALL_DIR ${THIRD_PARTY_PATH}/install/openblas)
SET(CBLAS_REPOSITORY https://github.com/xianyi/OpenBLAS.git)
SET(CBLAS_REPOSITORY ${GIT_URL}/xianyi/OpenBLAS.git)
SET(CBLAS_TAG v0.3.7)
IF(WITH_ARM)
# Under the FT2000 architecture, the calculation result of blas.sgemm in openblas 0.3+ is wrong,
Expand Down
2 changes: 1 addition & 1 deletion cmake/external/protobuf.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -198,7 +198,7 @@ FUNCTION(build_protobuf TARGET_NAME BUILD_FOR_HOST)
"-Dprotobuf_MSVC_STATIC_RUNTIME=${MSVC_STATIC_CRT}")
ENDIF()

SET(PROTOBUF_REPOSITORY https://github.com/protocolbuffers/protobuf.git)
SET(PROTOBUF_REPOSITORY ${GIT_URL}/protocolbuffers/protobuf.git)
SET(PROTOBUF_TAG 9f75c5aa851cd877fb0d93ccc31b8567a6706546)

cache_third_party(${TARGET_NAME}
Expand Down
Loading