summaryrefslogtreecommitdiffstats
Commit message (Expand)AuthorAgeFilesLines
* Removed PromptProcessor, modularized training loopVolpeon2023-01-139-293/+334
* Added TI decay start offsetVolpeon2023-01-132-3/+9
* Code deduplicationVolpeon2023-01-136-146/+149
* UpdateVolpeon2023-01-123-34/+63
* Fixed TI decayVolpeon2023-01-122-9/+12
* Disable Adam weight decayVolpeon2023-01-121-1/+1
* FixVolpeon2023-01-112-5/+5
* HeckVolpeon2023-01-111-1/+1
* TI: Use grad clipping from LoRA #104Volpeon2023-01-114-12/+15
* Better defaultsVolpeon2023-01-102-7/+6
* FixVolpeon2023-01-102-2/+2
* Added arg to disable tag shufflingVolpeon2023-01-104-18/+37
* Enable buckets for validation, fixed vaildation repeat argVolpeon2023-01-093-10/+5
* Add --valid_set_repeatVolpeon2023-01-093-1/+37
* No cache after allVolpeon2023-01-081-17/+7
* Cache token IDs in datasetVolpeon2023-01-081-8/+20
* FixVolpeon2023-01-082-6/+6
* Improved aspect ratio bucketingVolpeon2023-01-083-4/+61
* Fixed aspect ratio bucketingVolpeon2023-01-081-3/+5
* CleanupVolpeon2023-01-081-15/+25
* Fixed aspect ratio bucketing; allow passing token IDs to pipelineVolpeon2023-01-085-68/+102
* Improved aspect ratio bucketingVolpeon2023-01-084-223/+237
* CleanupVolpeon2023-01-073-178/+201
* Made aspect ratio bucketing configurableVolpeon2023-01-073-18/+37
* Added progressive aspect ratio bucketingVolpeon2023-01-075-126/+151
* UpdateVolpeon2023-01-073-11/+6
* FixVolpeon2023-01-061-2/+4
* Relaxed ema_power for TIVolpeon2023-01-061-1/+1
* UpdateVolpeon2023-01-064-33/+26
* Use context manager for EMA, on_train/eval hooksVolpeon2023-01-063-81/+92
* Package updateVolpeon2023-01-061-1/+1
* Add prompt template argument to inferenceVolpeon2023-01-061-0/+7
* Add contextmanager to EMAModel to apply weights temporarilyVolpeon2023-01-062-33/+36
* Log EMA decayVolpeon2023-01-051-1/+2
* Added EMA to TIVolpeon2023-01-054-10/+157
* Fix LR finderVolpeon2023-01-051-7/+23
* UpdateVolpeon2023-01-051-1/+1
* UpdateVolpeon2023-01-057-51/+75
* FixVolpeon2023-01-052-10/+13
* Various cleanupsVolpeon2023-01-058-1091/+133
* UpdateVolpeon2023-01-046-9/+15
* Various updatesVolpeon2023-01-045-34/+87
* Better eval generatorVolpeon2023-01-043-13/+13
* Fixed reproducibility, more consistant validationVolpeon2023-01-045-43/+113
* Don't use vector_dropout by defaultVolpeon2023-01-032-2/+2
* Added vector dropoutVolpeon2023-01-034-15/+69
* Fixed LR finderVolpeon2023-01-023-22/+26
* UpdateVolpeon2023-01-023-9/+55
* FixVolpeon2023-01-022-8/+2
* Save args before training, tooVolpeon2023-01-022-0/+8