- Machine Learning With Go
- Daniel Whitenack
- 205字
- 2021-07-08 10:37:29
Creating data repositories for data versioning
If you followed the local installation of Pachyderm specified in the Pachyderm documentation, you should have the following:
- Kubernetes running in a Minikube VM on your machine
- The pachctl command line tool installed and connected to your Pachyderm cluster
Of course, if you have a production cluster running in a cloud, the following steps still apply. Your pachctl would just be connected to the remote cluster.
We will be demonstrating data versioning functionality with the pachctl Command-line Interface ( CLI) tool below (which is a Go program). However, as mentioned above, Pachyderm has a full-fledged Go client. You can create repositories, commit data, and much more directly from your Go programs. This functionality will be demonstrated later in Chapter 9, Deploying and distributing Analyses and Models.
To create a repository of data called myrepo, you can run this code:
$ pachctl create-repo myrepo
You can then confirm that the repository exists with list-repo:
$ pachctl list-repo
NAME CREATED SIZE
myrepo 2 seconds ago 0 B
This myrepo repository is a collection of data that we have defined and is ready for housing-versioned data. Right now, there is no data in the repository, because we haven't put any data there yet.
推薦閱讀
- Mastering RabbitMQ
- LabVIEW Graphical Programming Cookbook
- 自己動手實現Lua:虛擬機、編譯器和標準庫
- MATLAB圖像處理超級學習手冊
- Java FX應用開發教程
- 我的第一本算法書
- 信息安全技術
- 數據結構(C語言)
- SQL Server 2016數據庫應用與開發
- 零基礎學Python網絡爬蟲案例實戰全流程詳解(入門與提高篇)
- Mastering Business Intelligence with MicroStrategy
- Oracle GoldenGate 12c Implementer's Guide
- 小程序從0到1:微信全棧工程師一本通
- 多媒體技術及應用
- Scratch編程從入門到精通