You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* .
* .
* Update code_test.py
* .
* Update Dockerfile
* Do not use conda-merge
* Move all 3 conda files to a single dir
* Do not use conda-merge
* Pin package versions
* PR review fixes
* Update Dockerfile
* PR review fixes
* Update training_dependencies.yml
* Update code_test.py
Copy file name to clipboardExpand all lines: docs/code_description.md
+7-4Lines changed: 7 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -2,9 +2,7 @@
2
2
3
3
### Environment Setup
4
4
5
-
-`environment_setup/requirements.txt` : It consists of a list of python packages which are needed by the train.py to run successfully on host agent (locally).
6
-
7
-
-`environment_setup/install_requirements.sh` : This script prepares the python environment i.e. install the Azure ML SDK and the packages specified in requirements.txt
5
+
-`environment_setup/install_requirements.sh` : This script prepares a local conda environment i.e. install the Azure ML SDK and the packages specified in environment definitions.
8
6
9
7
-`environment_setup/iac-*.yml, arm-templates` : Infrastructure as Code piplines to create and delete required resources along with corresponding arm-templates.
10
8
@@ -27,6 +25,12 @@
27
25
-`ml_service/pipelines/diabetes_regression_verify_train_pipeline.py` : determines whether the evaluate_model.py step of the training pipeline registered a new model.
28
26
-`ml_service/util` : contains common utility functions used to build and publish an ML training pipeline.
29
27
28
+
### Environment Definitions
29
+
30
+
-`diabetes_regression/training_dependencies.yml` : Conda environment definition for the training environment (Docker image in which train.py is run).
31
+
-`diabetes_regression/scoring_dependencies.yml` : Conda environment definition for the scoring environment (Docker image in which score.py is run).
32
+
-`diabetes_regression/ci_dependencies.yml` : Conda environment definition for the CI environment.
33
+
30
34
### Code
31
35
32
36
-`diabetes_regression/training/train.py` : a training step of an ML training pipeline.
@@ -39,5 +43,4 @@
39
43
40
44
### Scoring
41
45
-`diabetes_regression/scoring/score.py` : a scoring script which is about to be packed into a Docker Image along with a model while being deployed to QA/Prod environment.
42
-
-`diabetes_regression/scoring/conda_dependencies.yml` : contains a list of dependencies required by score.py to be installed in a deployable Docker Image
43
46
-`diabetes_regression/scoring/inference_config.yml`, deployment_config_aci.yml, deployment_config_aks.yml : configuration files for the [AML Model Deploy](https://marketplace.visualstudio.com/items?itemName=ms-air-aiagility.private-vss-services-azureml&ssr=false#overview) pipeline task for ACI and AKS deployment targets.
Copy file name to clipboardExpand all lines: docs/getting_started.md
+8-1Lines changed: 8 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -171,7 +171,13 @@ Great, you now have the build pipeline set up which automatically triggers every
171
171
172
172
**Note:** The build pipeline also supports building and publishing ML
173
173
pipelines using R to train a model. This is enabled
174
-
by changing the `build-train-script` pipeline variable to either `diabetes_regression_build_train_pipeline_with_r.py`, or `diabetes_regression_build_train_pipeline_with_r_on_dbricks.py`. For pipeline training a model with R on Databricks you'll need
174
+
by changing the `build-train-script` pipeline variable to either of:
175
+
*`diabetes_regression_build_train_pipeline_with_r.py` to train a model
176
+
with R on Azure ML Compute. You will also need to add the
177
+
`r-essentials` Conda packages into `diabetes_regression/scoring_dependencies.yml`
178
+
and `diabetes_regression/training_dependencies.yml`.
to train a model with R on Databricks. You will need
175
181
to manually create a Databricks cluster and attach it to the ML Workspace as a
176
182
compute (Values DB_CLUSTER_ID and DATABRICKS_COMPUTE_NAME variables should be
177
183
specified).
@@ -243,6 +249,7 @@ Make sure your webapp has the credentials to pull the image from the Azure Conta
243
249
* You should edit the pipeline definition to remove unused stages. For example, if you are deploying to ACI and AKS, you should delete the unused `Deploy_Webapp` stage.
244
250
* The sample pipeline generates a random value for a model hyperparameter (ridge regression [*alpha*](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.Ridge.html)) to generate 'interesting' charts when testing the sample. In a real application you should use fixed hyperparameter values. You can [tune hyperparameter values using Azure ML](https://docs.microsoft.com/en-us/azure/machine-learning/service/how-to-tune-hyperparameters), and manage their values in Azure DevOps Variable Groups.
245
251
* You may wish to enable [manual approvals](https://docs.microsoft.com/en-us/azure/devops/pipelines/process/approvals) before the deployment stages.
252
+
* You can install additional Conda or pip packages by modifying the YAML environment configurations under the `diabetes_regression` directory. Make sure to use fixed version numbers for all packages to ensure reproducibility, and use the same versions across environments.
246
253
* You can explore aspects of model observability in the solution, such as:
247
254
***Logging**: navigate to the Application Insights instance linked to the Azure ML Portal,
248
255
then to the Logs (Analytics) pane. The following sample query correlates HTTP requests with custom logs
0 commit comments