Posts
1589
Following
139
Followers
883
I'm a bit of an eclectic mess ๐Ÿ™‚ I've been a programmer, journalist, editor, TV producer, and a few other things.

I'm currently working on my second novel which is complete, but is in the edit stage. I wrote my first novel over 20 years ago but then didn't write much till now.

I post about #Coding, #Flutter, #Writing, #Movies and #TV. I'll also talk about #Technology, #Gadgets, #MachineLearning, #DeepLearning and a few other things as the fancy strikes ...

Lived in: ๐Ÿ‡ฑ๐Ÿ‡ฐ๐Ÿ‡ธ๐Ÿ‡ฆ๐Ÿ‡บ๐Ÿ‡ธ๐Ÿ‡ณ๐Ÿ‡ฟ๐Ÿ‡ธ๐Ÿ‡ฌ๐Ÿ‡ฒ๐Ÿ‡พ๐Ÿ‡ฆ๐Ÿ‡ช๐Ÿ‡ซ๐Ÿ‡ท๐Ÿ‡ช๐Ÿ‡ธ๐Ÿ‡ต๐Ÿ‡น๐Ÿ‡ถ๐Ÿ‡ฆ๐Ÿ‡จ๐Ÿ‡ฆ

Fahim Farook

And with 11 papers boosted out of 82 in the cs.CV category on arXiv, we are done with papers for the day ๐Ÿ™‚

#AI #CV #NewPapers #DeepLearning #MachineLearning
0
0
0

Fahim Farook

"DiffFaceSketch: High-Fidelity Face Image Synthesis with Sketch-Guided Latent Diffusion Model. (arXiv:2302.06908v1 [cs.CV])" โ€” Today seems to be the day for papers on turning sketches into realistic colour images using diffusion models ๐Ÿ˜› This is another approach ...

Paper: http://arxiv.org/abs/2302.06908

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
A Sketch-Guided Lantent Diffusiโ€ฆ
0
1
1

Fahim Farook

"Text-Guided Scene Sketch-to-Photo Synthesis. (arXiv:2302.06883v1 [cs.CV])" โ€” Creating a whole scene color image based on a sketch using generative models such as Stable Diffusion.

Paper: http://arxiv.org/abs/2302.06883

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
Overview of training and sampliโ€ฆ
0
1
0

Fahim Farook

"Make Your Brief Stroke Real and Stereoscopic: 3D-Aware Simplified Sketch to Portrait Generation. (arXiv:2302.06857v1 [cs.CV])" โ€” Creating photorealistic images of people based on sketches using 3D generative models.

Paper: http://arxiv.org/abs/2302.06857

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
The visualization of our Stereoโ€ฆ
0
1
0

Fahim Farook

"DiffFashion: Reference-based Fashion Design with Structure-aware Transfer by Diffusion Models. (arXiv:2302.06826v1 [cs.CV])" โ€” Transfer the input image appearance onto images of items of clothing while not altering the structure of the clothing item.

Paper: http://arxiv.org/abs/2302.06826
Code: https://github.com/Rem105-210/DiffFashion

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
Two examples of a reference-basโ€ฆ
0
1
0

Fahim Farook

"Multiple Appropriate Facial Reaction Generation in Dyadic Interaction Settings: What, Why and How?. (arXiv:2302.06514v2 [cs.CV] UPDATED)" โ€” An attempt to generate appropriate behavioral responses to received stimulus and to evaluate the appropriateness of the generated responses.

Paper: http://arxiv.org/abs/2302.06514

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
Multiple appropriate reaction gโ€ฆ
0
1
0

Fahim Farook

"NYCU-TWO at Memotion 3: Good Foundation, Good Teacher, then you have Good Meme Analysis. (arXiv:2302.06078v2 [cs.CL] UPDATED)" โ€” Classifying the emotions and intensity expressed in memes using machine learning.

Paper: http://arxiv.org/abs/2302.06078

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
Overview of the Meme Encoder
0
1
0

Fahim Farook

"Vector Quantized Diffusion Model with CodeUnet for Text-to-Sign Pose Sequences Generation. (arXiv:2208.09141v2 [cs.CV] UPDATED)" โ€” Using diffusion models to generate sign language pose sequences based on spoken language.

Paper: http://arxiv.org/abs/2208.09141

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
The forward diffusion process aโ€ฆ
0
1
0

Fahim Farook

"ChatCAD: Interactive Computer-Aided Diagnosis on Medical Image using Large Language Models. (arXiv:2302.07257v1 [cs.CV])" โ€” Using Large Language Models (LLM) with Computer-Aided Diagnosis (CAD) networks to enhance the output of CAD networks by summarizing and presenting the information in a more understandable format.

Paper: http://arxiv.org/abs/2302.07257

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
Overview of our proposed strateโ€ฆ
0
0
0

Fahim Farook

"Painting 3D Nature in 2D: View Synthesis of Natural Scenes from a Single Semantic Mask. (arXiv:2302.07224v1 [cs.CV])" โ€” Using a semantic mask as input to generate photorealistic color images of natural scenes.

Paper: http://arxiv.org/abs/2302.07224

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
Given only a single semantic maโ€ฆ
0
1
0

Fahim Farook

"Netizens, Academicians, and Information Professionals' Opinions About AI With Special Reference To ChatGPT. (arXiv:2302.07136v1 [cs.CY])" โ€” It's interesting to see what people think about ChatGPT (and AI in general) given all the current rush to embrace ChatGPT as the saviour of many businesses/corporations ... ๐Ÿ˜› But a little light on specifics โ€” mostly general statistics/impressions.

Paper: http://arxiv.org/abs/2302.07136

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
0
2
0

Fahim Farook

"Universal Guidance for Diffusion Models. (arXiv:2302.07121v1 [cs.CV])" โ€” a universal guidance algorithm that enables diffusion models to be controlled by arbitrary guidance methods such as segmentation, face recognition, object detection, and classifier signals, without the need to retrain for that specific method.

Paper: http://arxiv.org/abs/2302.07121
Code: https://github.com/arpitbansal297/Universal-Guided-Diffusion

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
Examples of diffusion being guiโ€ฆ
0
1
2

Fahim Farook

A total of 82 papers in the cs.CV category on arXiv.org today โ€” 47 new, 35 updated.

Have CoreML models I want to play with, but I guess Iโ€™ll have to read papers instead ๐Ÿ˜›

#AI #CV #NewPapers #DeepLearning #MachineLearning
0
0
0

Fahim Farook

Edited 2 years ago
Since yesterday evening, when I finally perfected the process for converting existing #StableDiffusion models (in either CKPT or SAFETENSOR format) to CoreML format, Iโ€™ve converted six models. Of them, all except for one work fine ๐Ÿ™‚

So, in case this helps anybody else, here are the important things to remember:

1. You *must* use Python 3.8. If you any other Python version, you will end up with errors. (Not that Iโ€™ve tested all Python versions, but I did have errors with Python 3.9 and have read reports of others โ€ฆ)

2. You should be on Ventura 13.1 or higher.

3. You need the models to be in Diffusers format to run the conversion, but the easiest way that has worked for me is to download a CKPT file, convert it to Diffusers and point the script at the local folder with the Diffusers format model.

4. HuggingFace folks have a bunch of conversion scripts here: https://github.com/huggingface/diffusers/tree/main/scripts

5. The above scripts donโ€™t mention SAFETENSOR format in the file names but SAFETENSOR is just CKPT with some changes. The CKPT conversion file has an extra argument named โ€œ--from_safetensorsโ€ so you can use the same script for CKPT to convert SAFETENSOR files with that extra argument.

6. You can use the Apple conversion script to convert one element at at a time using the different arguments such as โ€œ--convert-unetโ€, โ€œ--convert-text-encoderโ€ etc. You donโ€™t have to run all of them together. In fact, it turned out when I ran them all together, sometimes a component might be left out โ€” generally the text encoder.

7. Once youโ€™ve converted all the components and have them in one folder, you have to run the Apple conversion script once more with the โ€œ--bundle-resources-for-swift-cliโ€ argument (pointing at your output folder) to create the final compiled CoreML model files (.mlmodelc) from your .mlpackage files.

Thatโ€™s it ๐Ÿ™‚ If you do all of the above, it should be fairly straightforward to create new CoreML models from existing StableDiffusion models.

Feel free to hit me up should you run into issues. Since Iโ€™ve gone through all this, Iโ€™d be happy to help anybody else facing the same issues โ€ฆ.

#CoreML #StableDiffusion #MachineLearning #DeepLearning #ModelConversion
0
1
1

Sounds awesome, please let us see it Joe.
https://patreon.com/lowqualityfacts

3
2
0

I get knocked down, but I get up again briefly, complain a lot, then lie back down voluntarily

1
3
0

This ancient goat willow is half-tree, half-octopus!

It may have been broken by a Cumbrian storm, but is still very much alive - and supporting lots of other life too. One of my favourite trees ๐Ÿ’š

1
4
0
Show older