Skip to content

Commit 11b630b

Browse files
committed
remove manual model registration section and reorder sections
1 parent f65410b commit 11b630b

File tree

1 file changed

+15
-46
lines changed

1 file changed

+15
-46
lines changed

sdk/python/foundation-models/system/reinforcement-learning/reinforcement-learning.ipynb

Lines changed: 15 additions & 46 deletions
Original file line numberDiff line numberDiff line change
@@ -277,45 +277,7 @@
277277
"## <span style=\"font-size:0.8em;\"> </span>\n",
278278
"\n",
279279
"<div style=\"background: linear-gradient(135deg, #667eea 0%, #764ba2 100%); color: white; padding: 20px; border-radius: 10px; margin: 20px 0;\">\n",
280-
" <h3 style=\"margin: 0; text-align: center;\">📋Section 4. Creating Data Assets from Pipeline Output</h3>\n",
281-
"</div>"
282-
]
283-
},
284-
{
285-
"cell_type": "markdown",
286-
"metadata": {},
287-
"source": [
288-
"<p>Follow these steps to convert your pipeline outputs into re-usable data assets:</p>\n",
289-
"\n",
290-
"<ol style=\"color: #2c3e50; line-height: 1.8;\">\n",
291-
"<li><strong style=\"color: #9b59b6;\">Access Pipeline Overview:</strong> Double-click on the completed pipeline component to view the pipeline overview.</li>\n",
292-
"\n",
293-
"<li><strong style=\"color: #9b59b6;\">Locate Output Asset:</strong> In the outputs section, find the data asset named <code style=\"background: #f8f9fa; padding: 2px 6px; border-radius: 3px;\">model_output</code></li>\n",
294-
"\n",
295-
"<li><strong style=\"color: #9b59b6;\">Browse Datastore:</strong> Click <span style=\"background: #2e53cc63; color: white; padding: 2px 8px; border-radius: 3px;\">View in datastores browse</span> in the bottom left</li>\n",
296-
"\n",
297-
"<li><strong style=\"color: #9b59b6;\">Navigate Structure:</strong> Explore the tree structure of all folders in the output directory</li>\n",
298-
"\n",
299-
"<li><strong style=\"color: #9b59b6;\">Create Asset:</strong> Navigate to the <code style=\"background: #f8f9fa; padding: 2px 6px; border-radius: 3px;\">model_output</code> folder, open the context menu, and select <span style=\"background: #2e53cc63; color: white; padding: 2px 8px; border-radius: 3px;\">Create a data asset</span></li>\n",
300-
"\n",
301-
"<li><strong style=\"color: #9b59b6;\">Configure Asset:</strong> Enter a name for the output asset in the modal window</li>\n",
302-
"\n",
303-
"<li><strong style=\"color: #9b59b6;\">Complete Creation:</strong> Click the <span style=\"background: #2e53cc63; color: white; padding: 2px 8px; border-radius: 3px;\">Create</span> button</li>\n",
304-
"\n",
305-
"<li><strong style=\"color: #9b59b6;\">Verify Access:</strong> The new data asset will now be visible in the data section of Azure ML Studio</li>\n",
306-
"</ol>\n",
307-
"\n",
308-
"<p><em style=\"color: #7f8c8d;\">💡 These data assets can be reused across multiple workflows and experiments.</em></p>\n"
309-
]
310-
},
311-
{
312-
"cell_type": "markdown",
313-
"metadata": {},
314-
"source": [
315-
"## <span style=\"font-size:0.8em;\"> </span>\n",
316-
"\n",
317-
"<div style=\"background: linear-gradient(135deg, #667eea 0%, #764ba2 100%); color: white; padding: 20px; border-radius: 10px; margin: 20px 0;\">\n",
318-
" <h3 style=\"margin: 0; text-align: center;\">📊Section 5. Compare Model Performance across Base Model vs GRPO vs Reinforce++ </h3>\n",
280+
" <h3 style=\"margin: 0; text-align: center;\">📊Section 4. Compare Model Performance across Base Model vs GRPO vs Reinforce++ </h3>\n",
319281
"</div>"
320282
]
321283
},
@@ -342,6 +304,13 @@
342304
"<p><em>💡 The evaluation will help identify which RL method produces the most effective model for financial reasoning tasks.</em></p>"
343305
]
344306
},
307+
{
308+
"cell_type": "markdown",
309+
"metadata": {},
310+
"source": [
311+
"<p> We will now submit evaluation job, with grpo and rlpp model outputs </p>"
312+
]
313+
},
345314
{
346315
"cell_type": "code",
347316
"execution_count": null,
@@ -390,7 +359,7 @@
390359
"cell_type": "markdown",
391360
"metadata": {},
392361
"source": [
393-
"<p> Fetch metrics from evalution run </p>"
362+
"<p> Now, lets fetch metrics from evalution run inorder to show comparison</p>"
394363
]
395364
},
396365
{
@@ -484,7 +453,7 @@
484453
"## <span style=\"font-size:0.8em;\"> </span>\n",
485454
"\n",
486455
"<div style=\"background: linear-gradient(135deg, #667eea 0%, #764ba2 100%); color: white; padding: 20px; border-radius: 10px; margin: 20px 0;\">\n",
487-
" <h3 style=\"margin: 0; text-align: center;\">🧩Section 6. Create Draft Model for Speculative Decoding</h3>\n",
456+
" <h3 style=\"margin: 0; text-align: center;\">🧩Section 5. Create Draft Model for Speculative Decoding</h3>\n",
488457
"</div>"
489458
]
490459
},
@@ -533,7 +502,7 @@
533502
"## <span style=\"font-size:0.8em;\"> </span>\n",
534503
"\n",
535504
"<div style=\"background: linear-gradient(135deg, #667eea 0%, #764ba2 100%); color: white; padding: 20px; border-radius: 10px; margin: 20px 0;\">\n",
536-
" <h3 style=\"margin: 0; text-align: center;\">🔄Section 7. Prepare Combined Model for Deployment</h3>\n",
505+
" <h3 style=\"margin: 0; text-align: center;\">🔄Section 6. Prepare Combined Model for Deployment</h3>\n",
537506
"</div>\n",
538507
"\n"
539508
]
@@ -587,7 +556,7 @@
587556
"## <span style=\"font-size:0.8em;\"> </span>\n",
588557
"\n",
589558
"<div style=\"background: linear-gradient(135deg, #667eea 0%, #764ba2 100%); color: white; padding: 20px; border-radius: 10px; margin: 20px 0;\">\n",
590-
" <h3 style=\"margin: 0; text-align: center;\">🚀Section 8. Deploy Speculative Decoding Endpoint</h3>\n",
559+
" <h3 style=\"margin: 0; text-align: center;\">🚀Section 7. Deploy Speculative Decoding Endpoint</h3>\n",
591560
"</div>\n",
592561
"\n"
593562
]
@@ -627,7 +596,7 @@
627596
"## <span style=\"font-size:0.8em;\"> </span>\n",
628597
"\n",
629598
"<div style=\"background: linear-gradient(135deg, #667eea 0%, #764ba2 100%); color: white; padding: 20px; border-radius: 10px; margin: 20px 0;\">\n",
630-
" <h3 style=\"margin: 0; text-align: center;\">🚀Section 9. Deploy Base Model Endpoint for Comparison</h3>\n",
599+
" <h3 style=\"margin: 0; text-align: center;\">🚀Section 8. Deploy Base Model Endpoint for Comparison</h3>\n",
631600
"</div>\n",
632601
"\n"
633602
]
@@ -669,7 +638,7 @@
669638
"## <span style=\"font-size:0.8em;\"> </span>\n",
670639
"\n",
671640
"<div style=\"background: linear-gradient(135deg, #667eea 0%, #764ba2 100%); color: white; padding: 20px; border-radius: 10px; margin: 20px 0;\">\n",
672-
" <h3 style=\"margin: 0; text-align: center;\">🧪Section 10. Test Deployment</h3>\n",
641+
" <h3 style=\"margin: 0; text-align: center;\">🧪Section 9. Test Deployment</h3>\n",
673642
"</div>"
674643
]
675644
},
@@ -704,7 +673,7 @@
704673
"## <span style=\"font-size:0.8em;\"> </span>\n",
705674
"\n",
706675
"<div style=\"background: linear-gradient(135deg, #667eea 0%, #764ba2 100%); color: white; padding: 20px; border-radius: 10px; margin: 20px 0;\">\n",
707-
" <h3 style=\"margin: 0; text-align: center;\">📊Section 11. Performance Evaluation Pipeline</h3>\n",
676+
" <h3 style=\"margin: 0; text-align: center;\">📊Section 10. Performance Evaluation Pipeline</h3>\n",
708677
"</div>"
709678
]
710679
},

0 commit comments

Comments
 (0)