TRAINS - AI的自动神奇实验管理器和版本控制
TRAINS - Auto-Magical Experiment Manager & Version Control for AIFeatures
- Add
Task.delete()
support - Add
Task.debug_simulate_remote_task()
to simulate task execution by ClearML Agent - Add warning on archived Task in pipeline #274
- Add
Task.init(..., output_uri=True)
will use the default files_server as output uri - Make
clearml-data
CLI stateful, remember last dataset ID as default dataset - Added
HyperParameterOptimizer.get_optimizer_top_experiments()
for querying post execution optimization pipeline - Add
Task.set_archived()
andTask.get_archived()
- Add
Task.set_credentials()
option to store into credentials file - Add
clearml-data
close now auto uploads - Add
HyperParameterOptimizer
argumentsspawn_task_project
andsave_top_k_tasks_only
Bug Fixes
- Fix
PipelineController
running remotely without configuration does not execute the default code pipeline #273 - Fix reusing task after its project was deleted #274
- Fix
Task.archived_tag
read-only property does not work #274 - Fix argparse support to store consistent str representation of custom objects. Avoid changing default value if remote value matches
- Fix argsparse type as function
- Fix Dataset add single and multiple file(s)
- Fix get project name from parent dataset if not specified
- Fix mpl exporter. Added support for legend
- Fix model upload
- Fix optimizer callback best experiment
- Fix Optuna optimizer failing on tasks with
None
value in scalar query - Fix auto python package detection installed directly from URLs
- Fix dataset upload aborted on server watchdog
- Fix dataset genealogy, graph and restoring data
- Fix numpy dependency for python versions
Assets
3
allegroai-git released this
Bug Fixes
- Fix broken
clearml-task
CLI
Assets
3
allegroai-git released this
Bug Fixes
clearml-task
CLI link fixed
Assets
3
Features
- Add Hydra support #219
- Add cifar ignite example #237
- Add auto extraction of
tar.gz
files when usingStorageManager
#237 - Add
Task.init()
argumentauto_connect_streams
controlling stdout/stderr/logging capture #181 - Add carriage return flush support using the
sdk.development.worker.console_cr_flush_period
configuration setting #181 - Add
Task.create_function_task()
to allow creating a new task using a function and arguments to be executed remotely #230 - Allow disabling SSL certificates verification using
Task.setup_upload()
argumentverify
or AWS S3 bucket configurationverify
property #256 - Add
StorageManager.get_files_server()
- Add
Task.get_project_id()
using project name - Add
project_name
argument toTask.set_project()
- Add
Task.connect()
support for class / instance objects - Add
Task get_configuration_object() and
Task.set_configuration_object()` for easier automation - Improve Auto-Scaler - allow extra configurations, key name and security group are now optional, defaults using empty strings
- Use a built-in matplotlib convertor
- Add reporting text as debug sample example
Bug Fixes
- Fix Optuna HPO parameter serializing #254
- Fix connect dictionary
''
cast toNone
#258 - Fix lightgbm binding keyword argument issue #251
- Fix artifact preview if artifact body is remote URI #239
- Fix infinite recursion in
StorageManager
upload #253 - Fix keras reusing model object only if the filename is the same #252
- Fix running remotely with no configuration should not crash but output a warning #243
- matplotlib
- Fix matplotlib 3.3.3 support
- Fix global figure enumeration
- Fix binding without a title reported a single plot (
untitled 00
) instead of increasing the counter
- Fix Python 2.7/3.5 support
- Fix quote issue when reporting debug images
- Fix replace quote safe characters in upload file to include
;=@$
- Fix
at_exit
called from another process should be ignored - Fix
Task.set_tags()
for completed / published tasks - Fix
Task.add_tags()
not working when running remotely - Fix
Task.set_user_properties()
docstring and interface - Fix preview with JSON (dict) artifacts did not store the artifact
- Fix
Logger.report_text()
on task created usingTask.create()
was not supported - Fix initialization for torch: only call torch
get_worker_info
if torch was loaded - Fix flush (wait) on auxiliary task (obtained using
Task.get_task()
) should wait on all upload events - Fix server was not updated with the defaults from the code when running remotely and configuration section is missing
- Fix connect dict containing
None
default values, blocked the remote execution from passing string instead of None - Fix
Task.upload_artifact()
argumentdelete_after_upload=True
used in conjunction withwait_for_upload=True
was not supported
Assets
3
allegroai-git released this
Features
- Add LightGBM support
- Add initial Hydra support #219
- Add synchronous support for
Task.upload_artifact()
#231 - Add
sdk.development.store_code_diff_from_remote
(defaultfalse
) to store diff from remote HEAD instead of local HEAD #222 - Add
sdk.development.detect_with_conda_freeze
(defaulttrue
) for full conda freeze (requires trains-agent >= 16.2) - Add user properties support using
Task.get_user_properties
,Task.set_user_properties()
andTask.delete_user_properties()
- Add
Logger.report_table()
support for table as list of lists - Add support to split DAG and Table in pipeline DAG plot. Pipeline DAG single nodes are now round circles below the DAG graph.
- Add Pipeline/Optimization can be attached to any Task (not just the current task)
- Add
force_download
flag toStorageManager.get_local_copy()
- Add control over the artifact preview using
Task.upload_artifact()
preview
argument - Add
Logger.report_matplotlib_figure()
with examples - Add
Task.set_task_type()
- AWS auto-scaler
- Add key pair and security groups support
- Add multi-line support for both extra bash script and extra
trains.conf
data
- Update examples
Bug Fixes
- Fix
Task.update_output_model()
wrong argument order #220 - Fix initializing task on argparse parse in remote mode. Do not call
Task.init()
to avoid auto connect, useTask.get_task()
instead - Fix detected task cwd outside of repository root folder
- Fix
Task.connect(dict)
to place non-existing entries on the section name instead of General - Fix
Task.clone()
support for trains-server < 0.16 - Fix
StorageManager
cache extract zipped artifacts. Use modified time instead of access time for cached files - Fix diff command output was stripped
- Make sure local packages with multi-files are marked as
package
- Fix
Task.set_base_docker()
should be skipped when running remotely - Fix ArgParser binding handling of string argument with boolean default value (affects Pytorch Lightning integration)
- When using
detect_with_pip_freeze
make sure thatpackage @ file://
lines are replaced withpackage==x.y.z
as local file will probably not be available - Fix git packages to new pip standard
package @ git+
- Improve conda package naming
_
and-
support - Do not add specific setuptools version to requirements (pip can't install it anyway)
- Fix image URL quoting when uploading from a file path
Assets
3
Features
- Add
Task.set_resource_monitor_iteration_timeout()
to set ResourceMonitor iteration wait duration timeout #208 - Add PyTorch Lightning save/restore model binding #212
- Add
git diff
for repository submodule (requires git 2.14 or above) - Add
TrainsJob.is_completed()
andTrainsJob.is_aborted()
- Add
Task.logger
property - Add Pipeline Controller automation and example (see here)
- Add improved trace filtering capabilities in
trains.debugging.trace.trace_trains()
- Add default help per argument (if not provided) in ArgParser binding
- Deprecate
Task.reporter
- Update PyTorch example
- Remove warning on skipped auto-magic model logging #206
- Support Keras restructuring for Network, Model and Sequential
- Update autokeras requirements according to https://github.com/keras-team/autokeras#installation
Bug Fixes
- Fix joblib auto logging models failing on compressed streams #203
- Fix sending empty reports #205
- Fix scatter2d sub-sampling and rounding
- Fix plots reporting
NaN
representation (matplotlib conversion)- Limit the number of digits in a plot to reduce plot size (using
sdk.metrics.plot_max_num_digits
configuration value)
- Fix
Task.wait_for_status()
to reload after it ends - Fix thread wait Ctrl-C interrupt did not exit process
- Improve Windows support for installed packages analysis
- Fix auto model logging using relative path
- Fix Hyper-parameter Optimization example
- Fix
Task.clone()
when working with TrainsServer < 0.16.0 - Fix pandas artifact handling
- Avoid adding
unnamed:0
column - Return original pandas object
- Avoid adding
- Fix
TrainsJob
hyper-params overriding order was not guaranteed - Fix ArgParse auto-connect to support default function type
Assets
3
Features and Bug Fixes
- Fix
typing
dependency for Python<3.5 #184 - Fix
git+https
requirements handling, resolvetop_level.txt
package name (kerastuner
from git was not detected) - Fix
Task.get_reported_console_output()
for new Trains Server API v2.9 - Fix cache handling for different partitions/drives/devices
- Disable offline mode when running remotely (i.e. executed by Trains Agent)
- Fix artifact upload to only use file stream when not uploading a locally stored file (multipart upload is not supported on stream upload) #189
- Fix double-escaped model design text when connecting OutputModel
- Enhance HyperParameter optimizer
Assets
3
allegroai-git released this
Features
- Add continuing of previously executed experiments. Add
Task.init()
argumentcontinue_last_task
to continue a previously used Task #160 - Allow Task editing/creation from code.
Task.export_task/import_task/update_task()
#128 - Add offline mode. Use
Task.set_offline()
andTask.import_offline_session()
- Support setting offline mode via
TRAINS_OFFLINE_MODE=1
environment variable - Support setting offline API version via
TRAINS_OFFLINE_MODE=2.9
environment variable
- Support setting offline mode via
- Automatically pickle all objects when uploading as artifacts,
task.upload_artifact()
argumentauto_pickle=True
#153 - Add multiple sections/groups support for Task hyper-parameters using
Task.connect()
- Add multiple configurations (files) using
Task.connect_configuration
- Allow enabling OS environment logging using the
sdk.development.log_os_environments
configuration parameter (complements theTRAINS_LOG_ENVIRONMENT
environment variable) - Add Optuna support for hyper-parameter optimization controller.
OptimizerOptuna
is now the default optimizer. - Add initial Keras-Tuner support (keras-team/keras-tuner#334)
- Add automatic FastAI logging. It is disabled if tensorboard is loaded (assuming TensorBoradLogger will be used)
- Support Tensorboard text logging (
add_text()
) as debug samples (.txt
files), instead of as console output - Allow for more standard confusion matrix reporting.
Logger.report_confusion_matrix()
argumentyaxis_reversed
(flips the confusion matrix ifTrue
, defaultFalse
) #165 - Add support for Trains Server 0.16.0 (API v2.9 support)
- Allow disabling Trains update message from the log using the
TRAINS_SUPPRESS_UPDATE_MESSAGE
environment variable #157 - Add AWS EC2 Auto-Scaler service wizard and Service
- Improved and updated examples
- Add Keras Tuner CIFAR10 example
- Add FastAI example
- Update PyTorch Jupyter notebook examples #150
- Support global requirements detection using
pip freeze
(setsdk.development.detect_with_pip_freeze
configuration intrains.conf
) - Add
Task.get_projects()
to get all projects in the system, sorted by last update time
Bug Fixes
- Fix UTC to time stamp in comment #152
- Fix and enhance GPU monitoring
- Fix filename too long bug allegroai/trains-server#49
- Fix TensorFlow image logging to allow images with no width/height/color metadata #182
- Fix multiprocessing Pool throw exception in pool hangs execution. Call original signal handler and re-flush
stdout
- Fix
plotly
support formatplotlib
3.3 - Add Python 2.7 support for
get_current_thread_id()
- Update examples requirements
- Fix and improve signal handling
- Fix Tensorboard 2D convolution histogram, improve histogram accuracy on very small histograms
- Fix auto logging multiple argparse calls before
Task.init()
- Limit experiment Git diff logging to 500Kb. If larger than 500Kb, diff section will contain a warning and entire diff will be uploaded as an artifact named
auxiliary_git_dif
- Fix requirements detection
- Fix Trains installed from
git+
- Fix when Trains is not directly imported
- Fix multiple
-e
packages were not detected (only the first one) - Fix running with Trains in
PYTHONPATH
resulted in double entry of trains
- Fix Trains installed from
- Fix
Task.set_base_docker()
on main task to do nothing when running remotely
Assets
3
allegroai-git released this
Features
- Add direct plotly figure reporting #136
- Improve matplotlib integration #140
- Add
Task.set_parent()
#144 - Add
Task.wait_for_status()
#145 - Enhance framework
save
/load
binding callbacks pytorch/ignite#1056 - Add
TrainsJob.is_failed()
- Add APIClient support using
trains.backend_api.session.APIClient
- Add URL tracking when using
StorageManager.get_local_copy()
(log remote url instead of local file copy) - Improve & expand examples
Bug Fixes
- Fix
pytorch_tensorboard.py
example #148 - Fix
ssh://user@domain@server.com/
in Git repository link #149 - Fix do not call
make_deterministic()
on import - Fix initialization wizard (allow at most two verification retries, then print error) trains-agent #16
- Fix Google CoLab code/package detection
- Improve Pytorch-Ignite integration
- Fix TensorFlow >=2 histogram binding
- Fix
Logger.tensorboard_single_series_per_graph
- Fix anonymous named models
- Fix incorrect entry point detection when called from Trains wrapper (e.g.
TrainsLogger
in Pytorch Ignite/Lightning)
Assets
3
allegroai-git released this
Features
- Add automation support including hyper-parameters optimization (see example here)
Task.init()
auto_connect_arg_parser
argument can accept a dictionary disabling specific keys from the argparser (Trains Slack channel thread)- Allow
worker_id
override usingTRAINS_WORKER_NAME
environment variable (Trains Slack channel thread) - Support layout configuration for plotly objects using
extra_layout
argument in allLogger
reporting methods #136 - Add
Task.execute_remotely()
to allow cloning and enqueuing a locally executed task (or stopping and re-enqueuing a remotely executed task) #128 - Add Parquet framework and model type
- Support recursive model folder packaging
- Add
Task.get_reported_console_output()
andTask.get_reported_scalars()
to allow retrieval of reported reported output and scalar metrics - Add
Task.add_requirements()
to force requirement package into "installed packages" - Improve task reuse responsiveness
- Add
raise_on_error
(defaultFalse
) argument toModel.get_local_copy()
andArtifact.get_local_copy()
allegroai/trains-agent#17 - Support
Task.get_task()
without project name (i.e. all projects) - Support using the file calling
Task.init()
as the task's script in casesys.argv
doesn't point to a git repository - Support detecting and remotely executing code running from a module (i.e.
-m module
) - Add callback for framework
save
/load
binding for better integration with pytorch/ignite pytorch/ignite#1056 - Support new task types provided in Trains Server v0.15.0
- Add automation and distributed examples
- Upgrade default pip version to
<20.2
Bug Fixes
- Fix
exact_match_regex()
in case of empty pattern #138 - Address deprecation warning and newer
attrs
versions inMetricsEventAdapter
#134 - Fix issues with plotly support (Trains Slack channel thread and thread)
- Fix default argument behavior to match argparse behavior
- Fix
OutputModel
withtask=None
should use current task, if exists - Fix
Task.get_task()
to raise proper error on incorrecttask_id
- Fix
Task.enqueue()
to use an exact queue name match - Fix
NaN
,Inf
and-Inf
values display in reported table (not supported by JSON) - Limit max requirement size to 0.5MB
- Fix issues with repository analysis
- Fix
StorageManager
should only try to extract .zip files,Model
should not auto extract package allegroai/trains-agent#17
Assets
3
allegroai-git released this
Features and Bug Fixes
- Add Logger histogram mode (stack/group/relative)
- Add PEP610 support for new pip>=20.1 git reference feature (see pypa/pip#609) #62
- Better support for PyTorch Ignite integration
- Support
#
and?
in uploaded image file names - Support
ArgumentParser.parse_arg()
used before initializing task whenauto_connect_arg_parser=False
- Fix upload_object_via_stream() in Azure storage driver
- Fix
Task.get_tasks()
return type
Assets
3
Features
- Add media (audio) support for both Logger and Tensorboard binding using
Logger.report_media()
#120 - Add autokeras example (see here) #125
- Add caching of extracted zip artifacts
- Add
Task.get_tasks()
filtering support - Add
trains.storage.StorageManager
- Add nicer stdout log flush
- Add text logging to TF V1 example
- Improve SSL retries and error handling
- Improve shutdown behavior in case of a user abort (Ctrl-C)
- Improve repository and module detection
- Improve resource monitor
- Report memory usage for process (and sub-processes)
- Report GPU memory usage for process (and sub-processes)
- Stability
- Improve Jupyter support
- Make sure
trains
is included in Jupyter requirements - Ignore IPython directives in converted Python script (like
%
and!
lines)
- Make sure
- Update Pytorch/TensorboardX examples
Bug Fixes
- Fix renaming/deleting model file right after saving will break asynchronous upload #123
- Fix shutdown behavior:
- Logger flush delay and termination
- Repository/package detection termination
- Threads termination
- Fix text encoding
utf-8
andpr_curve
broken in Tensorboard support - Fix single log request exceeds max packet size
- Fix upload debug assets with series / title including
,
or/
did not add iteration index to the filename - Fix
scikit-learn
module detection - Use UID in message if user name cannot be retrieved (from the Trains Slack channel)
- Fix cast in Tensorflow v2 example
Assets
3
Features and Bug Fixes
- Add OpenMPI/Slurm support, support
TRAINS_FORCE_MASTER_NODE
environment variable to allow forcefully specifying a node is master (not based on the MPI Rank) - Add
Task.get_models()
andTask.models
for retrieving stored models for previously executed tasks - Support multiple
EventWriter
in TensorFlow eager mode (TF 2.0+) #26 - Support reusing models
- Fix TensorFlow
NaN
/Inf
values support #118 - Fix
joblib
binding - Fix metrics/files/models upload on shutdown
- Fix support for
Task
init/close multiple times - Update manual_reporting.py example with
Logger.report_table()
example - Deprecate
Task.set_model_config()
,Task.get_model_config_text()
andTask.get_model_config_dict()
Assets
3
Features and Bug Fixes
- Add support for reporting tables (
pandas
,csv
etc.) usingLogger.report_table()
- Support setting task initial iteration for continuing previous runs using
Task.set_initial_iteration()
andTask.get_initial_iteration()
- Add multiprocess task support #96
- Add
trains-init
support for config file override using theTRAINS_CONFIG_FILE
environment variable or--file
command line argument #105 - Add initial
slurm
support - Allow disabling repository detection when calling
Task.init()
- Add Jupyter notebook server execution example
- Update examples
- Better warning messages for storage errors and missing storage packages
- Fix HTTP link quoting in stored links
Assets
3
Features and Bug Fixes
- Add binding for
tensorboard.summarywriter.addscalars
- Add
Logger.tensorboard_single_series_per_graph()
configuration option to allow per-series TensorBoard-like graphs - Add docker image interface using
Task.set_base_docker()
andTask.get_base_docker()
- Use standard OS environment variables to obtain default credentials for:
- AWS:
AWS_ACCESS_KEY_ID
,AWS_SECRET_ACCESS_KEY
andAWS_DEFAULT_REGION
- Azure:
AZURE_STORAGE_ACCOUNT
andAZURE_STORAGE_KEY
- Google:
GOOGLE_APPLICATION_CREDENTIALS
- AWS:
- Add simple dictionary task parameters interface using
Task.get_parameters_as_dict()
andTask.set_parameters_as_dict()
allowing getting and setting parameters for task instances obtained usingTask.get_task()
- Make sure
Task.connect()
always returns the connected instance passed to it - Prefer
tensorflow_gpu
overtensorflow
when inspecting installed packages and recording experiment dependencies - Remove title/series naming restrictions (allow
$
and.
) when reporting metrics - Fix incorrect printouts in initialization wizard and upgrade notifications
- Fix debug images URL for uploaded files with
%
in their name
Assets
3
Features and Bug Fixes
- Allow reporting a pre-uploaded image url in
Logger.report_image()
using theurl
parameter - Add support for git repositories without ".git" suffix (e.g. Azure Repos)
- Improve conda support
- Improve hyper-parameters argparser integration
- Fix
savefig()
patching inmatplotlib
binding - Fix logs, events and jupyter flushing on exit
Assets
3
Features and Bug Fixes
- Add support for
pyplot.savefig
andpylab.savefig
in matplotlib binding - Add support for SageMaker
- Improved configuration wizard
- Try to make sure tensorboard is available when using torch
- Do not store keras model network design if it cannot be serialized #72
- Fix matplotlib binding support
Assets
3
allegroai-git released this
Features and Bug Fixes
- Add support for trains-server v0.13.0
- Add support for nested (non-main) tasks
- Add warning when automatic argument parser binding cannot be turned off
- Add
Task.upload_artifact()
support for external URLs (pre-uploaded) - Add support for special characters in hyper-parameter keys (white-spaces,
.
and$
) #69 - Add support for PyTorch
.pt
model files - Calculate data-audit artifact uniqueness by user-criteria #45
- Use an environment variable for setting a default docker image #58
- Improve
trains-init
configuration wizard - Update examples for new joblib versions
- Update jupyter example to TensorFlow 2
- Fix task clone to copy only input artifacts
- Fix matplotlib import binding when using
Agg
backend - Fix
ProxyDictPreWrite
andProxyDictPostWrite
so they can be pickled correctly #72 - Fix requests issue in Python 2.7 that can cause a deadlock when importing netrc
- Fix argparser binding sub-parser and type casting support #74
- Fix argparser binding python 2.7 unicode handling
- Fix unsynced connected hyper parameters overridden during remote execution
Assets
3
Features and Bug Fixes
- Add
default_output_uri
feature #57 - Improve
trains-init
configuration wizard - Improve argparser binding
- Fix artifact support in data pipeline #63
- Fix threading issue while querying multiple experiments #64
- Fix uploading large files over slow HTTP connections
- Fix support for Git versions < 2
Breaking Changes
- Do not reuse an experiment with artifacts
Assets
3
Features and Bug Fixes
- Add
Task.connect_configuration()
, connect your configuration file (or configuration dictionary including nesting support). See example. - Add
Task.connect_label_enumeration()
, store model detection classes together with the model & experiment - Edit & override configuration file (or dictionary) content when executing with trains-agent
Task.connect()
automatically supports nested dictionaries (e.g.param/nested/key
) , including overriding values when running with trains-agent. See example.- Add
Artifact.get()
, download & load an artifact. - Add
Task.get_tasks()
, retrieve previous experiments, get metrics, rename etc. - Improve Windows support
- Improve Minio support
- Fix Python2 support
- Fix Issue #56
Assets
3
allegroai-git released this
Features and Bug Fixes
- TRAINS Agent support (Full DevOps support)
- Add AutoML and orchestration examples
Supports TRAINS-server v0.12 or above - Add Logger x/y/z axis title for: report_surface / report_confusion_matrix / report_scatter3d / report_scatter2d / report_histogram
- Add support for TensorFlow 2.0
- Embed pyhocon into package
- Fix artifacts support on Windows
- Fix example code Windows support
Assets
3
Features and Bug Fixes
- Resource-Monitor will only monitor active GPU devices
based on environment variables: NVIDIA_VISIBLE_DEVICES or CUDA_VISIBLE_DEVICES - Fix issue #48
Assets
3
Features and Bug Fixes
- Fix Python 2.7 support
- Improve sample code Windows support
Assets
3
Features and Bug Fixes
- GPU Monitoring is now embedded into trains (removed gpustat dependency)
- Add initial support for Tensorflow v2.0 (tested with v2.0.0rc1)
- Add artifact upload retry on network errors (default: 3)
- Suppress urllib3 retry warnings
- Fix Matplotlib support with Agg backend (multiple plot windows caused repeated graphs to be sent)
- Fix support for tuples in hyper-parameters
- Fix multi processing issues with different task types
Assets
3
allegroai-git released this
Features and Bug Fixes
- Full artifacts support (supported by trains-server >= 0.11.0)
- Artifacts include, Pandas.DataFrame, Numpy, PIL Image, local files, and local folder/wildcard (example)
- Artifacts support for folder/wildcard, selected files will be zipped and uploaded
- Resource monitoring, remove sensor reading failure warnings
Breaking Changes
- Logger
info
/error
/warning
/console
functions were removed, useLogger.report_text
(or python logging or print instead) - Tensorboard scalars are not grouped into one graph, but are stored on individual graphs (to match Tensorboard behavior). To restore previous behavior call
Logger.tensorboard_auto_group_scalars(group_scalars=True)
Assets
3
Features and Bug Fixes
- Artifacts support
- Removed apache-libcloud from requirements
trains-init
now verifies credentials against the trains-server installation
Assets
3
Features and Bug Fixes
- Fix broken (v0.10.5) Keras Binding support
Assets
3
allegroai-git released this
Features and Bug Fixes
- Replace opencv-python with the more standard Pillow package
- Improve matplotlib support (custom axis ticks)
- Improve python package detection
Assets
3
allegroai-git released this
Features and Bug Fixes
Assets
3
allegroai-git released this
Features and Bug Fixes
- Add Matplotlib SVG support
- Add Seaborn support
- Add TRAINS_LOG_ENVIRONMENT environment logging feature
- Add Microsoft Azure notebook support
- Add Google Colab support
- Fix Tensorboard RGB channel order
Assets
3
Features and Bug Fixes
- Fix Jenkins CI/CD support
Assets
3
-
Experiment code execution detection
- Automatically create package requirements section (including used versions)
- Automatically detect and store source code uncommitted changes
- Jupyter notebook support, automatically convert notebook to python script (stored under uncommitted changes)
- Jupyter notebook support, automatically update used packages in notebook (including used versions)
-
Add resource monitoring to experiment metrics
- Sampled every 500ms, averaged over 30 seconds
- CPU / Network / IO / Memory etc.
- For GPU support please install gpustat
(currently not part of the requirements due to gpustat compatibility issues with Windows)
$ pip install gpustat
-
Automatically stop inactive experiments (default: 2 hours)
-
Improved visibility
- Finer status definitions: Identify successful completion vs. user aborted
- Experiment plot comparison: Ensure different colors for different experiments
- Parse newline character in experiment description
- Show experiment start time in table display
- Add vertical guide in scalar plots
- Move Hyper-parameters to designated tab
- "Admin" section now named "Profile"
Assets
3
allegroai-git released this
Features and Bug Fixes
- Improved Jupyter and inline Matplotlib support
- Allow for insecure SSL connection to trains-server (use with care!)
- Automatically announce new trains version available (at least until we hit version 1.0)
- Fix support for local git branches (i.e. no matching remote branch)
- Verified support for Official TensorFlow 1.14 release and PyTorch with TensorBoard
Assets
3
Watchers:68 |
Star:2193 |
Fork:331 |
创建时间: 2019-06-10 16:18:32 |
最后Commits: 4天前 |
许可协议:Apache-2.0 |
34c41cf
Compare
Features and Bug Fixes
HyperParameterOptimizer
parallel coordinates #279Task.init()
argumenttags
HyperParameterOptimizer
spawn_task_project
tospawn_project
nargs
support was brokenaction="append"
file://
links