Posts
1368
Following
141
Followers
868
I'm a bit of an eclectic mess ๐Ÿ™‚ I've been a programmer, journalist, editor, TV producer, and a few other things.

I'm currently working on my second novel which is complete, but is in the edit stage. I wrote my first novel over 20 years ago but then didn't write much till now.

I post about #Coding, #Flutter, #Writing, #Movies and #TV. I'll also talk about #Technology, #Gadgets, #MachineLearning, #DeepLearning and a few other things as the fancy strikes ...

Lived in: ๐Ÿ‡ฑ๐Ÿ‡ฐ๐Ÿ‡ธ๐Ÿ‡ฆ๐Ÿ‡บ๐Ÿ‡ธ๐Ÿ‡ณ๐Ÿ‡ฟ๐Ÿ‡ธ๐Ÿ‡ฌ๐Ÿ‡ฒ๐Ÿ‡พ๐Ÿ‡ฆ๐Ÿ‡ช๐Ÿ‡ซ๐Ÿ‡ท๐Ÿ‡ช๐Ÿ‡ธ๐Ÿ‡ต๐Ÿ‡น๐Ÿ‡ถ๐Ÿ‡ฆ๐Ÿ‡จ๐Ÿ‡ฆ

Fahim Farook

"Scalable and Equivariant Spherical CNNs by Discrete-Continuous (DISCO) Convolutions. (arXiv:2209.13603v3 [cs.CV] UPDATED)" โ€” A hybrid discrete-continuous (DISCO) group convolution for spherical convolutional neural networks (CNN) that is simultaneously equivariant and computationally scalable to high-resolution.

Paper: http://arxiv.org/abs/2209.13603

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
Spherical CNN categorization
0
0
1

Fahim Farook

"Multi-Level Visual Similarity Based Personalized Tourist Attraction Recommendation Using Geo-Tagged Photos. (arXiv:2109.08275v2 [cs.MM] UPDATED)" โ€” A geo-tagged photo based tourist attraction recommendation system which utilizes the visual contents of photos and interaction behavior data to obtain the final embeddings of users and tourist attractions, which are then used to predict the visit probabilities.

Paper: http://arxiv.org/abs/2109.08275
Code: https://github.com/revaludo/MEAL

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
An illustration of the multi-leโ€ฆ
0
0
0

Fahim Farook

"BiAdam: Fast Adaptive Bilevel Optimization Methods. (arXiv:2106.11396v3 [math.OC] UPDATED)" โ€” A novel fast adaptive bilevel framework to solve stochastic bilevel optimization problems that the outer problem is possibly nonconvex and the inner problem is strongly convex.

Paper: http://arxiv.org/abs/2106.11396

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
The basic idea of the convergenโ€ฆ
0
0
0

Fahim Farook

"Sparse Oblique Decision Trees: A Tool to Understand and Manipulate Neural Net Features. (arXiv:2104.02922v2 [cs.LG] UPDATED)" โ€” An effort to understanding which of the internal features computed by the neural net are responsible for a particular class, by mimicking part of the neural net with an oblique decision tree having sparse weight vectors at the decision nodes.

Paper: http://arxiv.org/abs/2104.02922

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
Mimicking part of a neural net โ€ฆ
0
1
0

Fahim Farook

"Don't Play Favorites: Minority Guidance for Diffusion Models. (arXiv:2301.12334v1 [cs.LG])" โ€” A framework that can make the generation process of the diffusion models focus on the minority samples, which are instances that lie on low-density regions of a data manifold.

Paper: http://arxiv.org/abs/2301.12334

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
Diffusion models play favoritesโ€ฆ
0
1
0

Fahim Farook

"SEGA: Instructing Diffusion using Semantic Dimensions. (arXiv:2301.12247v1 [cs.CV])" โ€” A semantic guidance method for diffusion models to allow making subtle and extensive edits and changes in composition and style, as well as optimize the overall artistic conception.

Paper: http://arxiv.org/abs/2301.12247

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
Semantic control over image genโ€ฆ
0
1
0

Fahim Farook

"Anticipate, Ensemble and Prune: Improving Convolutional Neural Networks via Aggregated Early Exits. (arXiv:2301.12168v1 [cs.LG])" โ€” A new training technique based on weighted ensembles of early exits, which aims at exploiting the information in the structure of networks to maximise their performance.

Paper: http://arxiv.org/abs/2301.12168

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
Outline of the AEP technique. Oโ€ฆ
0
1
0

Fahim Farook

"ClusterFuG: Clustering Fully connected Graphs by Multicut. (arXiv:2301.12159v1 [cs.CV])" โ€” A simpler and potentially better performing graph clustering formulation based on multicut (a.k.a. weighted correlation clustering) on the complete graph.

Paper: http://arxiv.org/abs/2301.12159

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
Example illustration of dense mโ€ฆ
0
1
0

Fahim Farook

"Towards Equitable Representation in Text-to-Image Synthesis Models with the Cross-Cultural Understanding Benchmark (CCUB) Dataset. (arXiv:2301.12073v1 [cs.CV])" โ€” A culturally-aware priming approach for text-to-image synthesis using a small but culturally curated dataset to fight the bias prevalent in giant datasets.

Paper: http://arxiv.org/abs/2301.12073

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
Sample images generated for fivโ€ฆ
0
1
0

Fahim Farook

"Cross-Architectural Positive Pairs improve the effectiveness of Self-Supervised Learning. (arXiv:2301.12025v1 [cs.CV])" โ€” A novel self-supervised learning approach that leverages Transformer and CNN simultaneously to overcome the issues with existing self-supervised techniques which have extreme computational requirements and suffer a substantial drop in performance with a reduction in batch size or pretraining epochs.

Paper: http://arxiv.org/abs/2301.12025

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
In our proposed self-supervisedโ€ฆ
0
2
1

Fahim Farook

"Improved knowledge distillation by utilizing backward pass knowledge in neural networks. (arXiv:2301.12006v1 [cs.LG])" โ€” Addressing the issue with Knowledge Distillation (KD) where there is no guarantee that the model would match in areas for which you do not have enough training samples, by generating new auxiliary training samples based on extracting knowledge from the backward pass of the teacher in the areas where the student diverges greatly from the teacher.

Paper: http://arxiv.org/abs/2301.12006

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
a) Minimization Step: Using theโ€ฆ
0
1
0

Fahim Farook

"RGB Arabic Alphabets Sign Language Dataset. (arXiv:2301.11932v1 [cs.CV])" โ€” An Arabic Alphabet Sign Language (AASL) dataset comprising of 7,856 raw and fully labelled RGB images of the Arabic sign language alphabets which might be the first such publicly available dataset.

Paper: http://arxiv.org/abs/2301.11932

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
Sample images from the dataset
0
1
1

Fahim Farook

"Input Perturbation Reduces Exposure Bias in Diffusion Models. (arXiv:2301.11706v1 [cs.LG])" โ€” An exploration of the fact that the the long sampling chain in Denoising Diffusion Probabilistic Models (DDPM) leads to an error accumulation phenomenon, which is similar to the exposure bias problem in autoregressive text generation.

Paper: http://arxiv.org/abs/2301.11706

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
The inputs and the prediction tโ€ฆ
0
1
0

Fahim Farook

"Image Restoration with Mean-Reverting Stochastic Differential Equations. (arXiv:2301.11699v1 [cs.LG])" โ€” A stochastic differential equation (SDE) approach for general-purpose image restoration which can restore images without relying on any task-specific prior knowledge.

Paper: http://arxiv.org/abs/2301.11699

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
An overview of our proposed conโ€ฆ
0
1
0

Fahim Farook

"Accelerating Guided Diffusion Sampling with Splitting Numerical Methods. (arXiv:2301.11558v1 [cs.CV])" โ€” A solution to speeding up guided diffusion image generation based on operator splitting methods, motivated by the finding that classical high-order numerical methods are unsuitable for the conditional function.

Paper: http://arxiv.org/abs/2301.11558

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
Generated samples of a classifiโ€ฆ
0
1
0

Fahim Farook

"3DShape2VecSet: A 3D Shape Representation for Neural Fields and Generative Diffusion Models. (arXiv:2301.11445v1 [cs.CV])" โ€” A novel shape representation for neural fields designed for generative diffusion models, which can encode 3D shapes given as surface models or point clouds, and represents them as neural fields.

Paper: http://arxiv.org/abs/2301.11445

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
Left: Shape autoencoding resultโ€ฆ
0
1
0

Fahim Farook

"Improving Cross-modal Alignment for Text-Guided Image Inpainting. (arXiv:2301.11362v1 [cs.CV])" โ€” A model for text-guided image inpainting by improving cross-modal alignment (CMA) using cross-modal alignment distillation and in-sample distribution distillation.

Paper: http://arxiv.org/abs/2301.11362

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
Different categories of vision-โ€ฆ
0
1
0

Fahim Farook

"Rethinking 1x1 Convolutions: Can we train CNNs with Frozen Random Filters?. (arXiv:2301.11360v1 [cs.CV])" โ€” An exploration into whether Convolutional Neural Networks (CNN) learning the weights of vast numbers of convolutional operators is really necessary.

Paper: http://arxiv.org/abs/2301.11360

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
Validation accuracy of LCResNetโ€ฆ
0
4
4

Fahim Farook

"Multimodal Event Transformer for Image-guided Story Ending Generation. (arXiv:2301.11357v1 [cs.CV])" โ€” A multimodal event transformer, an event-based reasoning framework for image-guided story ending generation which constructs visual and semantic event graphs from story plots and ending image, and leverages event-based reasoning to reason and mine implicit information in a single modality.

Paper: http://arxiv.org/abs/2301.11357

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
Given a multi-sentence story plโ€ฆ
0
3
2

Fahim Farook

"Animating Still Images. (arXiv:2209.10497v2 [cs.CV] UPDATED)" โ€” A method for imparting motion to a still 2D image which uses deep learning to segment part of the image as the subject, uses in-paining to complete the background, and then adds animation to the subject.

Paper: http://arxiv.org/abs/2209.10497

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too ๐Ÿ™‚>>
Interactive segmentation: Greenโ€ฆ
0
1
1
Show older