summaryrefslogtreecommitdiffstats
Commit message (Collapse)AuthorAgeFilesLines
* UpdateVolpeon2023-01-123-34/+63
|
* Fixed TI decayVolpeon2023-01-122-9/+12
|
* Disable Adam weight decayVolpeon2023-01-121-1/+1
|
* FixVolpeon2023-01-112-5/+5
|
* HeckVolpeon2023-01-111-1/+1
|
* TI: Use grad clipping from LoRA #104Volpeon2023-01-114-12/+15
|
* Better defaultsVolpeon2023-01-102-7/+6
|
* FixVolpeon2023-01-102-2/+2
|
* Added arg to disable tag shufflingVolpeon2023-01-104-18/+37
|
* Enable buckets for validation, fixed vaildation repeat argVolpeon2023-01-093-10/+5
|
* Add --valid_set_repeatVolpeon2023-01-093-1/+37
|
* No cache after allVolpeon2023-01-081-17/+7
|
* Cache token IDs in datasetVolpeon2023-01-081-8/+20
|
* FixVolpeon2023-01-082-6/+6
|
* Improved aspect ratio bucketingVolpeon2023-01-083-4/+61
|
* Fixed aspect ratio bucketingVolpeon2023-01-081-3/+5
|
* CleanupVolpeon2023-01-081-15/+25
|
* Fixed aspect ratio bucketing; allow passing token IDs to pipelineVolpeon2023-01-085-68/+102
|
* Improved aspect ratio bucketingVolpeon2023-01-084-223/+237
|
* CleanupVolpeon2023-01-073-178/+201
|
* Made aspect ratio bucketing configurableVolpeon2023-01-073-18/+37
|
* Added progressive aspect ratio bucketingVolpeon2023-01-075-126/+151
|
* UpdateVolpeon2023-01-073-11/+6
|
* FixVolpeon2023-01-061-2/+4
|
* Relaxed ema_power for TIVolpeon2023-01-061-1/+1
|
* UpdateVolpeon2023-01-064-33/+26
|
* Use context manager for EMA, on_train/eval hooksVolpeon2023-01-063-81/+92
|
* Package updateVolpeon2023-01-061-1/+1
|
* Add prompt template argument to inferenceVolpeon2023-01-061-0/+7
|
* Add contextmanager to EMAModel to apply weights temporarilyVolpeon2023-01-062-33/+36
|
* Log EMA decayVolpeon2023-01-051-1/+2
|
* Added EMA to TIVolpeon2023-01-054-10/+157
|
* Fix LR finderVolpeon2023-01-051-7/+23
|
* UpdateVolpeon2023-01-051-1/+1
|
* UpdateVolpeon2023-01-057-51/+75
|
* FixVolpeon2023-01-052-10/+13
|
* Various cleanupsVolpeon2023-01-058-1091/+133
|
* UpdateVolpeon2023-01-046-9/+15
|
* Various updatesVolpeon2023-01-045-34/+87
|
* Better eval generatorVolpeon2023-01-043-13/+13
|
* Fixed reproducibility, more consistant validationVolpeon2023-01-045-43/+113
|
* Don't use vector_dropout by defaultVolpeon2023-01-032-2/+2
|
* Added vector dropoutVolpeon2023-01-034-15/+69
|
* Fixed LR finderVolpeon2023-01-023-22/+26
|
* UpdateVolpeon2023-01-023-9/+55
|
* FixVolpeon2023-01-022-8/+2
|
* Save args before training, tooVolpeon2023-01-022-0/+8
|
* FixVolpeon2023-01-022-4/+2
|
* UpdateVolpeon2023-01-022-13/+12
|
* FixVolpeon2023-01-021-1/+1
|