問題描述
我真的很想訪問 pandas 0.19 中的一些更新功能,但 Azure ML 工作室使用 pandas 0.18 作為 Anaconda 4.0 捆綁包的一部分.有沒有辦法更新執行 Python 腳本"組件中使用的版本?
我提供以下步驟來展示如何在Execute Python Script
中更新pandas庫的版本.
第一步:使用virtualenv
組件在你的系統中創建一個獨立的python運行環境.請先用命令pip install virtualenv
如果你沒有的話.
如果你安裝成功,你可以在你的 python/Scripts 文件中看到它.
Step2:運行命令創建獨立的python運行環境.
第三步:然后進入創建目錄的Scripts文件夾并激活它(這一步很重要,不要錯過)
請不要關閉此命令窗口并在此命令窗口中使用 pip install pandas==0.19
下載外部庫.
第 4 步:將 Lib/site-packages 文件夾中的所有文件壓縮成一個 zip 包(我這里叫它 pandas - 包)
第 5 步:將壓縮包上傳到 Azure Machine Learning WorkSpace DataSet.
具體步驟請參考
第6步:在Execute Python Script模塊定義方法azureml_main
之前,需要去掉舊的pandas
模塊&它的依賴項,然后再次導入 pandas
,如下面的代碼.
導入系統將熊貓導入為 pd打印(pd.__version__)del sys.modules['pandas']del sys.modules['numpy']del sys.modules['pytz']del sys.modules['六']del sys.modules['dateutil']sys.path.insert(0, '.\Script Bundle')for td in [m for m in sys.modules if m.startswith('pandas.') or m.startswith('numpy.') or m.startswith('pytz.') or m.startswith('dateutil.') 或 m.startswith('six.')]:del sys.modules[td]將熊貓導入為 pd打印(pd.__version__)# 入口點函數最多可以包含兩個輸入參數:# 參數<dataframe1>:一個pandas.DataFrame# 參數<dataframe2>:一個pandas.DataFramedef azureml_main(dataframe1 = 無,dataframe2 = 無):
然后你可以從日志中看到結果如下,首先打印舊版本0.14.0
,然后從上傳的zip文件中打印新版本0.19.0
.
[信息] 0.14.0[信息] 0.19.0
您還可以參考以下主題:訪問在 Azure 中使用時間戳的 blob 文件 和通過重置重新加載.
希望對你有幫助.
I would really like to get access to some of the updated functions in pandas 0.19, but Azure ML studio uses pandas 0.18 as part of the Anaconda 4.0 bundle. Is there a way to update the version that is used within the "Execute Python Script" components?
I offer the below steps for you to show how to update the version of pandas library in Execute Python Script
.
Step 1 : Use the virtualenv
component to create an independent python runtime environment in your system.Please install it first with command pip install virtualenv
if you don't have it.
If you installed it successfully ,you could see it in your python/Scripts file.
Step2 : Run the commad to create independent python runtime environment.
Step 3 : Then go into the created directory's Scripts folder and activate it (this step is important , don't miss it)
Please don't close this command window and use pip install pandas==0.19
to download external libraries in this command window.
Step 4 : Compress all of the files in the Lib/site-packages folder into a zip package (I'm calling it pandas - package here)
Step 5 :Upload the zip package into the Azure Machine Learning WorkSpace DataSet.
specific steps please refer to the Technical Notes.
After success, you will see the uploaded package in the DataSet List
Step 6 : Before the defination of method azureml_main
in the Execute Python Script module, you need to remove the old pandas
modules & its dependencies, then to import pandas
again, as the code below.
import sys
import pandas as pd
print(pd.__version__)
del sys.modules['pandas']
del sys.modules['numpy']
del sys.modules['pytz']
del sys.modules['six']
del sys.modules['dateutil']
sys.path.insert(0, '.\Script Bundle')
for td in [m for m in sys.modules if m.startswith('pandas.') or m.startswith('numpy.') or m.startswith('pytz.') or m.startswith('dateutil.') or m.startswith('six.')]:
del sys.modules[td]
import pandas as pd
print(pd.__version__)
# The entry point function can contain up to two input arguments:
# Param<dataframe1>: a pandas.DataFrame
# Param<dataframe2>: a pandas.DataFrame
def azureml_main(dataframe1 = None, dataframe2 = None):
Then you can see the result from logs as below, first print the old version 0.14.0
, then print the new version 0.19.0
from the uploaded zip file.
[Information] 0.14.0
[Information] 0.19.0
You could also refer to these threads: Access blob file using time stamp in Azure and reload with reset.
Hope it helps you.
這篇關于在 Azure ML Studio 中將 pandas 更新到 0.19 版的文章就介紹到這了,希望我們推薦的答案對大家有所幫助,也希望大家多多支持html5模板網!