Smart&Nimble

How a Startup Simplified Cybersecurity – and Landed a $32 Billion Payday

Dmitri Lihhatsov — Thu, 20 Mar 2025 08:55:19 GMT

Complexity is an expensive habit in cybersecurity. Each additional tool adds layers—not only of protection but also of friction. But what if the key to cybersecurity resilience isn't further complexity but elegant, intelligent simplicity?

Google's recent $32 billion acquisition of Wiz, a young Israeli cloud security startup, proves exactly that. Founded in 2020, Wiz revolutionized cloud security by prioritizing simplicity. It delivered a lightweight and ultra-effective solution that swiftly garnered nearly $1 billion in annual recurring revenue and captured half of the Fortune 100 companies as customers before the acquisition.

How did they accomplish this extraordinary rise? Let's dive in:

Wiz's Secret Sauce: Radical Simplicity

Unlike traditional players who patch together multiple layers of complexity, Wiz pursued radical simplification. Its founders—Israeli entrepreneurs experienced in cybersecurity—leveraged lessons from successfully scaling their previous startup, Adallom (acquired by Microsoft). Wiz’s solution didn't complicate; it clarified.

Three decisions defined their approach:

- Unified, comprehensive functionality: Instead of assembling an expensive web of security tools such as firewalls, intrusion detection systems, and threat intelligence products, Wiz innovatively integrated these functions into one unified, intuitive platform. Simple. Singular. Streamlined.

- Cloud-native from Day One: Where legacy solutions adapted slowly and clumsily to cloud environments, Wiz focused exclusively on cloud infrastructure from inception. No retrofitting. No expensive adjustments. Pure, effortless cloud compatibility.

- Cutting-Edge Integration (eBPF): Wiz adopted extended Berkeley Packet Filter (eBPF)—technology providing kernel-level observability without additional agent installations. In simple terms, Wiz quietly embedded itself within systems, exerting minimal strain while providing maximum vigilance.

Riding the Cloud-wave: Perfect Timing

Wiz exploded onto the cybersecurity scene just as the world was shifting hastily towards cloud solutions in response to the pandemic. Digital transformations hit warp speed, seemingly overnight. And with digitalization comes vulnerability—complexity opens doors for cybercriminals and increases enterprise fragility.

Wiz provided simplicity during this chaos, a clear voice in the noise of legacy solutions struggling to adapt. Wiz's impeccable timing allowed it to address a genuine and urgent industry pain point—the struggle of countless companies to secure their rapidly expanding digital infrastructures without ballooning budgets or frazzled IT departments.

Thanks to rigorous simplicity and clarity of purpose, Wiz quickly dominated. Its robust product-market fit resonated widely, solving the challenges that traditional, fragmented solutions had created.

AI: Wiz's Bold New Frontier

But elegant simplicity alone doesn’t account for Wiz’s steep valuation and Google's keen interest. Equally pivotal was Wiz’s intelligent integration of artificial intelligence.

Wiz harnesses AI-driven proactive cybersecurity surveillance and threat analysis, offering predictive security capabilities with remarkable accuracy. Instead of exclusively hunting known threats, Wiz’s AI-powered models predict evolving threats based on nuanced patterns in massive datasets—a critical innovation given today's fast-changing cyber threat landscape.

AI models trained on comprehensive datasets from Wiz’s large enterprise clients helped the startup pinpoint new vulnerabilities not yet recognized by traditional methods. By detecting patterns and friction points proactively, Wiz transformed cybersecurity processes from reactive to predictive. Rather than constantly putting out fires, enterprises could anticipate threats before they materialized.

Consider enterprises adopting Wiz in high-stakes scenarios: financial institutions storing substantial client data or healthcare providers ensuring patient privacy. Wiz's intelligent threat detection delivered both peace of mind and competitive advantage—no surprise that half of Fortune 100 companies swiftly adopted Wiz's solutions.

What Business Leaders Can Take From Wiz’s Journey

Entrepreneurs & Business Owners:
Cybersecurity doesn’t have to be complicated or costly. Wiz showed that protecting your infrastructure doesn't come from layering many solutions. Instead, choosing one comprehensive, AI-enabled, and simply designed security platform significantly streamlines your operation, reduces operational complexity, and ultimately cuts costs. Think simplicity, not complication.

Startup Founders:
Wiz offers profound strategic insights into scaling with AI solutions. Embrace predictive models that anticipate future problems before they emerge. Develop solutions specifically tailored for the environments you aim to serve: avoid retrofitting legacy offerings onto new problems—innovate from scratch. Look at how Wiz leveraged previously untapped technology like eBPF; ask yourself, "What new tool can transform our market?"

Decision-Makers & Tech Leaders:
Reevaluate your cybersecurity approach by considering a shift from traditional, reactive measures to proactive, AI-driven solutions. Predictive and adaptive models allow you to not just respond to threats but stay ahead of them. Aim to manage risks before they materialize into crises, complementing cybersecurity investment with clear visibility into compliance and flexibility needed in rapidly changing regulatory climates.

The Road Ahead: AI, Simplicity, and the Cybersecurity Revolution

Google’s massive investment in Wiz isn't merely testimony to the startup’s brilliance—it's a bellwether for cybersecurity’s future. We're entering a phase where AI will define competitiveness in cybersecurity, driving both strategic innovations and substantial cost savings. Indeed, simplicity, transparency, and proactive security solutions will increasingly be seen as key differentiators in an increasingly noisy and commoditized cybersecurity marketplace.

Wiz's groundbreaking approach has set the stage for new cybersecurity standards, offering critical lessons for modern enterprises:

- Complex, fragmented solutions won't scale efficiently in the modern enterprise.
- Leveraging innovative AI-driven approaches now stands paramount.
- Businesses should prioritize streamlined platforms that reduce operational overhead and improve efficiency.

Ultimately, Wiz’s story isn't just about cybersecurity. It's a broader lesson in scale, innovation, and the future of digital business—demonstrating the incredible value available to those who dare choose simplicity in a complicated world.

And that, perhaps, is the most strategic move of all.

Report: Can AI Help Small Businesses Compete with Giants Like Amazon and Temu? Lessons from Apple's New iPhone Strategy

Dmitri Lihhatsov — Sun, 23 Feb 2025 14:08:18 GMT

Executive Summary

Small and medium-sized businesses (SMBs) face increasing pressure from e‑commerce behemoths like Amazon and Temu. These giants leverage vast resources, sophisticated AI, and aggressive pricing strategies.

However, AI is also becoming more accessible to SMBs, offering opportunities to level the playing field. Apple's iPhone 16e strategy—democratising access to its AI features by offering a more affordable model—provides a valuable framework for SMBs to consider.

This report explores how SMBs can strategically adopt AI to enhance competitiveness, focusing on efficiency, customer experience, and strategic market positioning.

I. The Challenge: The Amazon and Temu Effect

Price Wars and Scale: Temu, in particular, is disrupting the market with deeply discounted prices, often undercutting Amazon by significant margins (averaging $13.37 per product, with 77% of Amazon-listed products having a close match on Temu). Amazon is responding with initiatives like "Haul," its own low-price storefront. This price-driven competition is extremely difficult for SMBs to match directly.
Established Trust: While Temu offers lower prices, consumer trust remains a major advantage for Amazon (87% trust Amazon vs. 5% for Temu). This highlights the importance of building strong customer relationships, an area where SMBs can excel.
AI-Powered Personalization: Both Amazon and, increasingly, other platforms like eBay, are using AI to personalise the shopping experience, recommending products, optimising search results (e.g., Amazon's Rufus chatbot), and streamlining logistics. This level of personalisation has become a consumer expectation.

II. The Opportunity: AI as a Leveler for SMBs

AI offers SMBs several key advantages:

A. Cost Savings and Efficiency:
- Automation: AI can automate repetitive tasks like inventory management, order processing, and customer service inquiries. This frees up staff to focus on higher-value activities.
- Optimized Operations: AI can analyze data to identify inefficiencies in processes, leading to cost reductions.
- Example: AI-powered tools for order orchestration can optimise fulfilment, reducing shipping costs and delivery times.
B. Enhanced Customer Experience:
- 24/7 Customer Support: AI-powered chatbots can provide instant responses to common customer questions, improving satisfaction and freeing up human agents for complex issues.
- Personalized Recommendations: AI can analyze customer data to offer tailored product recommendations, increasing sales and customer loyalty.
- Improved Communication: AI can help craft personalized email campaigns, social media posts, and other marketing materials. Ebay's AI tool for simplifying social media sharing is an example of this trend.
C. Data-Driven Decision Making:
- Real-time Analytics: AI can provide real-time insights into sales trends, customer behaviour, and market dynamics.
- Predictive Analytics: AI can forecast demand, optimise pricing strategies, and identify potential risks.
- Example: AI can help SMBs understand which marketing channels are most effective, allowing them to allocate resources more efficiently.
D. Competitive Intelligence:
- Market Trend Analysis: AI can track competitor pricing, product offerings, and marketing campaigns.
- Customer Sentiment Analysis: AI can monitor social media and online reviews to gauge customer sentiment towards a business and its competitors.
E. Niche Market Focus:
- AI can help SMBs identify and target specific customer segments that larger competitors may overlook, allowing for more effective, personalised marketing.

III. Lessons from Apple's iPhone 16e Strategy

Apple's approach to the iPhone 16e offers several relevant lessons for SMBs:

Democratizing AI: The iPhone 16e makes Apple's AI features (Apple Intelligence) accessible to a broader audience at a lower price point. SMBs can similarly leverage AI-powered *tools* (rather than developing AI from scratch) to offer enhanced services and experiences without massive upfront investment.
Focusing on Core Strengths: Apple didn't try to compete solely on price; it maintained its focus on user experience and ecosystem integration. SMBs should identify their unique selling propositions (e.g., personalized service, specialized products, local expertise) and use AI to *enhance* those strengths, not replace them.
Strategic Pricing and Value: The iPhone 16e offers a compelling value proposition by balancing advanced features with affordability. SMBs can use AI to optimize pricing strategies, identify opportunities for bundling products or services, and offer targeted promotions.
Expanding Market Reach: The iPhone 16e is designed to appeal to price-sensitive consumers, particularly in emerging markets. SMBs can use AI to identify and target underserved customer segments, tailoring their offerings and marketing messages to specific needs.
Integrating with existing systems: Apple emphasized seamless integration. For SMBs this means they need to choose AI tools that are compatible with their current tools.

IV. Actionable Steps for SMBs

Identify Pain Points: Determine the most significant challenges and inefficiencies in your business. Where are you losing time, money, or customers?
Prioritize AI Use Cases: Focus on AI solutions that address those specific pain points and offer a clear return on investment (ROI). Start small and scale up.
Research AI Tools: Explore available AI-powered platforms and tools designed for SMBs. Many offer free trials or affordable subscription plans.

Consider tools for:

Customer Relationship Management (CRM): AI-powered CRM systems can automate lead scoring, personalize communication, and improve customer segmentation.
Marketing Automation: AI can automate email marketing, social media posting, and ad campaign optimization.
E-commerce Platforms: Platforms like Shopify and WooCommerce offer AI-powered features for product recommendations, inventory management, and fraud detection.
Chatbots: Implement AI-powered chatbots for customer service and sales support.
Data Analytics: Utilize AI-powered analytics tools to gain insights into customer behaviour, sales trends, and marketing performance.

Data is Key: Ensure you have clean, organized, and accessible data. AI algorithms need quality data to function effectively.
Training and Upskilling: Invest in training your team to use AI tools effectively.
Monitor and Iterate: Continuously track the performance of your AI implementations and make adjustments as needed. AI is not a "set it and forget it" solution.
Prioritize Cybersecurity: Implement strong security measures to protect customer data and your business from cyber threats, especially when using AI tools that handle sensitive information.
Consider Partnerships: Explore collaborations with AI startups or other SMBs to share resources and expertise.

V. Conclusion

The rise of e-commerce giants presents a formidable challenge, but AI offers SMBs a powerful toolkit to compete effectively. By strategically adopting AI, focusing on core strengths, and learning from innovative strategies like Apple's iPhone 16e approach, SMBs can enhance efficiency, personalize customer experiences, and gain a competitive edge in the evolving digital landscape. The key is to be proactive, adaptable and focused on delivering value to customers in ways that larger competitors cannot easily replicate. The future isn't about having the biggest AI, it's about smartly using the right AI.

Building a State-Of-The-Art Card Fraud Detection System in 9 Months

Dmitri Lihhatsov — Wed, 18 Mar 2020 23:13:19 GMT

NOTE: This article was originally published at the Revolut Tech publication on Medium on November 14th 2019.

Have you ever experienced the dismay of getting a call from your bank’s security team? The voice on the other end of the line reports that they detected a suspicious transaction.

“Please confirm that it’s you making a payment for $149.99.”

You start nervously recalling your latest purchases:

“Am I losing track of the things I buy, or is someone really trying to rob me?”

Your palms are sweating. It’s not you, so you ask your bank to cancel the payment. For the next few hours you’re overwhelmed with uneasy thoughts:

“How did the perpetrator get my payment data? Is it going to happen again? What did they buy?”

It’s irritating, to say the least, and I’ve been there myself. That’s why when I received an offer to build a fraud prevention solution at Revolut, I was happy to embrace the opportunity. What’s more, I was truly excited about solving this business problem using supervised machine learning methods.

In this article, I’ll tell you:

about the fraud prevention system we’ve created,
the technologies we used,
the results we’ve achieved by now,
and where we’re heading.

I’ll lead you through all the stages of the system creation, from definition to deployment. But first things first.

Meet Sherlock

Sherlock is our card fraud prevention system based on machine learning. It continuously and autonomously monitors Revolut users’ transactions. While a web store or a terminal at Starbucks is displaying the “Processing Payment” animation, in under 50 ms Sherlock evaluates your transaction.

If Sherlock finds it suspicious, it blocks the purchase and freezes your card. A follow-up push notification prompts you to confirm if it was a fraudulent payment, or not. If you respond that it was legit, the card is unblocked, and you simply repeat the purchase. If, however, you don’t recognise the transaction, your card gets terminated, and you can order a free replacement.

Sherlock in action

Fraudsters exploit advanced technologies, they learn fast, but so does Sherlock. Every night, Sherlock’s machine learning models are retrained to account for any missed fraudulent and incorrectly declined transactions.

Sherlock in numbers:

over $3M saved during the year in production
just 1c out of $100 is lost due to fraud
96% of fraudulent transactions are caught
30% of Sherlock’s fraud predictions turn out to be correct

What these numbers mean to our clients is the crucial difference between an unforgettable holiday and a holiday marred by being robbed and having to make do in a foreign country.

Defining project goals, metrics, and talent

Above all, we aimed to minimise Revolut fraud losses and the losses caused by incorrectly blocked transactions. Banks have dedicated departments for transaction analysis with hundreds of people calling their clients in order to prevent unauthorised transactions.

It really takes a huge effort to reduce the damage caused by scammers. We wanted to find a way to detect and prevent fraudulent card transactions efficiently with fewer resources. We needed to learn to predict which transactions a user may demand to chargeback. While diving deeper into the problem we realised the importance of catching the first instance in a sequence of fraudulent transactions.

Basically, we’re solving a binary classification problem here, i.e. identifying whether a given transaction is fraudulent or not. Thus, precision, recall, and the number of false-positive detections are our primary metrics. Adding metrics for detecting the first fraud in a row helped us tweak the machine learning performance during its training.

In the early days of our research, we needed an expert to perform data wrangling, data analysis, data visualisation, and machine learning. I took on the responsibilities of a data scientist and a backend engineer to build Sherlock, and later I used the help of my colleagues — mobile developers and platform engineers — to integrate it into the Revolut product and develop UI.

Discovering. Data sources

Having the right data is much more important than choosing a machine learning algorithm. At Revolut, we’ve had a table of transactions reported fraudulent since the early days. It was stored in a PostgreSQL database and was not suitable for doing the analysis.

To simplify the data processing and data visualisation, I set up a nightly delta dump from PostgreSQL into BigQuery — a warehouse database running on Google Cloud. It became easier to join multiple tables and perform an initial analysis of millions of transactions happening daily. Besides, the serverless nature of BigQuery relieved me from the need to maintain the infrastructure. My focus was on the ETL code only.

Design. Envisioning the solution

Having the right data in hand, we moved on to envisioning the end solution. It was important to decide on the architecture at such an early stage of the project to make sure that our solution would fit the pace and conditions of real-time processing.

For the task of detecting fraudulent card transactions, we decided to implement the lambda architecture. We designed a real-time transaction scoring system paired with a nightly batch data processing pipeline. The nightly job is aimed at fixing any inaccuracies that might have appeared throughout the day in user profiles.

We also wanted to increase the accuracy of data scoring and automate the fraud-prevention flow entirely. Therefore we decided to collect users’ feedback on declined suspicious transactions within the mobile app. Besides, this logic would empower our users to take control over their funds and make their own decision — they can either confirm a transaction and unblock their card, or terminate their card quickly and order a new one.

Development

Choosing the language

We’ve chosen Python both for development and production as it’s the most widely used language in the data science community.

Feature engineering

This is the most challenging and creative part of any machine learning project. At this stage, we investigated how we could use what we knew about customer behaviour and merchants to identify fraud patterns in real-time; what kinds of deviations should be considered risky.

Features are attributes that data scientists have to come up with based on the available data. Features are combined into a vector to serve as an input to a machine learning algorithm.

We categorised our features as follows:

Features with no or minimum pre-processing:
- Merchant’s name, city, category, currency, etc.
- USD amount of the transaction
- Transaction time of day
User-focused features that compare the values of the given transaction against the historical transaction data for the given user:
- How quickly does the user make the transaction?
- How much does the USD transaction amount differ from the average spending of this user with the given merchant and the category code, using this payment method, and at this time of day?
- Is it the first transaction of the user with this merchant? If yes, this merchant is monitored for the next few hours
Merchant-focused features that compare the current transaction with all the previous non-fraudulent and fraudulent transactions at this merchant or merchant category:
- How many users transacted with this merchant before? How many of them reported fraud with this merchant?
- How much and how often do users transact with this merchant?
- How long ago have we seen this merchant for the first time at Revolut?

Getting to scale

During the early stages of my research, I used to experiment with feature engineering in the Jupyter Notebook. But on a scale of millions of users, such an approach wouldn’t work.

Besides, I had to construct our training data in a way which would prevent data leakage. In other words, the features for every transaction should only be based on the previous transactions of a user when sorted chronologically. That way, we precisely emulate the production flow.

I turned to the Apache Beam framework using Google Cloud Dataflow as the runner. Beam allowed me to focus only on the logic of feature engineering for an individual user, leaving out the considerations about the parallelisation of the code and hardware infrastructure.

Choosing a machine learning algorithm

I have experimented with a wide range of machine learning algorithms: linear regression using Vowpal Wabbit, gradient boosting using Catboost and XGBoost, scikit implementations of random forest, SVM with linear and RBF kernels, one-class SVM and neural networks using Tensorflow.

In the end, we chose Catboost, an open-source library from Yandex implementing gradient boosting on decision trees, as it outperformed other algorithms on several metrics and the inference speed. This algorithm has proved itself robust on heterogeneous data, data which comes from different sources, containing both numerical features of varying nature and categorical features. Besides, it didn’t require much hyperparameter tuning.

How I dealt with an imbalanced dataset

Training a machine learning model on a highly imbalanced dataset requires a particular approach. The fraud rate in the training data is around 0.03%. At first, I reduced the number of non-fraudulent transactions raising the fraud ratio to about 10%. Then, I assigned weights to individual fraudulent transactions based on the amount of the transaction and several other factors.

I constructed the validation data using all transactions that happened chronologically for five weeks in the future. It’s essential to keep the fraud ratio in the validation dataset the same as in production. That way, we obtain the performance results as close as possible to the ones we can expect in production during live scoring.

Deployment. Our current production flow

Finally, in production, we had to take into account that the card fraud prevention system is a mission-critical application. Therefore we had to implement specific measures to ensure resilience and reliability of the service, reduce the response latency down to 50 ms, and make sure that all the important events were monitored and covered by alerts.

Let’s take a look into our current production flow

Production flow

It all starts with an offline nightly job that is orchestrated by the Google Cloud Composer, a managed service for Apache Airflow. The job dumps the delta of transactions for the day from our PostgreSQL database into BigQuery. Given that the schema of some tables might have changed, I’m modifying the BigQuery schemas on the fly and dumping the data there.

After that, an Apache Beam job running on Google Cloud Dataflow generates the training data that gets dumped back into BigQuery. At the same time, several Beam jobs create users and merchants profiles that are put into Couchbase — the in-memory NoSQL database.

The training data is then used by several machine learning jobs training Catboost models on Google Cloud AI platform. The trained models are stored in Google Cloud Storage.

What happens in real-time

When you make a card transaction, the Revolut’s processing backend sends it to the Sherlock Flask app deployed on Google Cloud App Engine. The trained machine learning models are pre-loaded into memory.

Upon receiving a transaction via an HTTP POST request, the Sherlock app fetches the corresponding user’s and merchant’s profiles from Couchbase. Then, it generates a feature vector — using the same features as the ones created in the Apache Beam job that produces the training data — and makes a prediction. The prediction is then sent in a JSON response to the processing backend where a corresponding action is taken — all within 50 ms.

In the end

If the probability of fraud is below a certain threshold, the processing backend lets the transaction go through, and the user sees a “Payment Approved” message on a terminal or a website.

If the probability of fraud is above the threshold, the processing backend declines the transaction and sends a push notification to the app.

Performance monitoring

We use Google Cloud Stackdriver and Kibana dashboards to monitor the system performance in real-time. Our areas of interest here are:

Functional performance, such as monitoring of merchants, number of alerts and frauds, number of true- and false-positives.
Operational performance, such as latency (how fast the system responds), number of transactions processed per second, and more.

Stackdriver sends us email and SMS alerts in rare cases of any issues.

What’s next?

We’re happy with our current results — the fraud rate is already low, but we’re going to continue improving the prediction quality. Essentially, we see Sherlock as the end product that can be purchased and integrated by other banks and financial institutions.

I often talk to people from banks, and some of them have already asked if we can sell our technology to them. Operations which banks process manually for hours, take just a few minutes at Revolut. If you’re going to sign up millions of users and scale your business to new markets worldwide, you cannot rely on manual work alone.

Nikolay Storonsky, CEO and CO-founder of Revolut.

Join Revolut

Today, Revolut has 9 million customers, and that number continues to grow. If you’re interested in helping us take Sherlock to the next level, check out our vacancies on the Careers page.

P.S. Have you ever received a push notification from Sherlock? How did it make you feel? Let me know in the comments!

How to Build an Infinitely Scalable Video Captioning Service with Firebase and Kubernetes

Dmitri Lihhatsov — Sun, 24 Nov 2019 23:58:05 GMT

When I was a kid, I loved playing Lego. It was magical seeing anything from castles to cars appear out of the same colourful building blocks.

Today, I love tinkering with software frameworks and Google Cloud products, building solutions to problems that I encounter in life. Just like with Lego – there is an infinite number of combinations. It only takes a little bit of imagination and continuous experimentation to put these building blocks to good use.

Last summer, I started recording short videos aiming to practice speaking on camera and build my personal brand. Yes, marketing is also one of those Lego blocks I discovered a couple of years ago.

I noticed that the best videos on social media have captions to catch the attention of people scrolling through their newsfeed with the sound off. It's also a matter of respect to people who can't hear.

I started looking for a solution that would allow me to record a video on the go, send it somewhere and get back a captioned version of it.

Either I wasn't patient googling it or those few web services, like Kapwing and Zubtitle, didn't have their SEO set up, I couldn't find anything. Funnily enough, I started seeing their ads right after I finished working on Captionly!

I decided to build such a service myself out of the available "Lego blocks".

In this article, I'm going to show you how I built Captionly – a web service that allows people to do just that – generate a captioned version of their videos.

What If It Doesn't Work?

But first, have you ever had such moments when you wanted to solve a problem but wasn't sure how? You decide to try an approach, but you aren't sure if it's going to work and if it actually makes sense. Such doubts nag you as you're building your solution, but you carry on nonetheless.

Then, sometime later, you hear someone talking about a similar approach or architecture at a conference. You experience a feeling of relief, knowing that you were right along the way! It boosts your confidence, and you become even more brave experimenting with your ideas.

Do you remember such moments in your life?

On 20-21st November 2019, Google held their annual event in London – Google Cloud Next. At one of the presentations, Bret McGowen showed how to build a serverless online shop – pretty much the same way I made my Captionly – with AppEngine and Cloud Functions. That's when I realised that what I developed made sense!

Building Captionly

Captionly Architecture on Google Cloud

Getting a text version of captions wasn't a problem. I knew about Rev.com – a service set up by guys from MIT many years ago. They built a network of professional captioners and throughout the years accumulated a high-quality dataset to train AI models outperforming Google and IBM! Last summer, they launched Rev.ai offering AI-generated captions with quality slightly lower than human captions but cheaper and much faster.

Besides, Rev have a convenient API allowing to automate the process of generating either human- or AI-made captions, transcripts and translations.

To build a service that returns captioned videos, we require three elements:

a website to let users upload a video
an integration with Rev to get a text file with captions
a service that embeds the captions into the video

I decided to try Firebase – a Google service that comes with the Firestore database, Cloud Functions and several other services that help build serverless web and mobile apps.

Firebase also allowed me not to worry about implementing secure user authentication because it provides a way to take care of that very elegantly supporting multiple social media logins.

User Authentication at Captionly through Firebase Authentication

To build the frontend, I used the React + Material-UI + Firebase boilerplate app that comes with ready-made integrations with Firebase Authentication. I combined React frontend with the Flask backend running on Google Cloud AppEngine Standard Environment.

Firebase Storage, which runs on Google Cloud Storage, provides a JavaScript SDK that I used to let Captionly users upload their videos directly to the Cloud Storage through the web browser. Firebase Storage comes with a way to define security rules making sure that users can read and write only their files.

When a user uploads her video, I create an entry in Firestore capturing the details of the order, such as the Storage path to the uploaded file.

Firestore allows writing Cloud Functions that get triggered automatically whenever a change happens in the database. We write such functions using JavaScript or TypeScript.

Once the user's order status changes to "Video Uploaded" in the database, a Firestore Function gets triggered to create a new order with Rev through their API. The order status gets changed to "Captions Order Submitted".

It takes a while for Rev to process the video and generate captions. Depending on the user's choice at Captionly, it takes from about an hour for high-quality human-made captions to a couple of minutes for AI-made captions.

When Rev completes the order, they trigger an endpoint that I created in Cloud Functions. The function downloads the text file with captions to the corresponding order folder in Cloud Storage. The order status gets changed to "Captions Created", followed by "Rendering Started".

This status change triggers the Firebase function again that sends the order details to my video rendering service.

Video Rendering with FFmpeg

Video rendering is an interesting problem. There are several video editing solutions ranging from paid ones like Adobe Premiere and Apple Final Cut Pro X to free ones. However, I didn't need a user interface to embed captions into videos. I wanted a command-line version to automate the process entirely.

That's how I discovered FFmpeg – an open-source console-only application that allows you to do anything you can imagine with videos as long as you are patient figuring out how to encode what you want to do using the command-line options.

To give you an idea, here's how to ask FFmpeg to embed captions into your videos to get a result like this:

ffmpeg -y -f lavfi -i color=color=#BF0210:size=3840x40 -t 38 -pix_fmt yuv420p dark_red_2000_27.mov && ffmpeg -y -i creative_block.MOV -i dark_red_2000_27.mov -filter_complex "[0:v]pad=w=iw:h=3840:x=0:y=840:color=white[padded];[padded][1:v]overlay=x='-w+(t*w/38)':y=3000[padded_progress];[padded_progress]drawtext=fontfile=/fonts/roboto/Roboto-Bold.ttf: text='OVERCOMING CREATIVE BLOCK': fontcolor=#BF0210: fontsize=200: x=(w-text_w)/2: y=(840-text_h)/2[titled];[titled]subtitles=creative_block.srt: force_style='Fontname=Roboto Bold,PrimaryColour=&H1002BF&,Outline=0,Fontsize=16,MarginV=0020'" -codec:a copy creative_block_padded.mov

I created a service that takes a video file, a corresponding text file with captions and merges them delivering a captioned version of the video.

Video rendering is a memory- and CPU-intensive process, so I must use powerful-enough virtual machines to accomplish the task.

Besides, I wanted my video rendering service to be scalable and automatically spin up necessary computing resources depending on the workload – the number of orders submitted through Captionly.

I decided to leverage the power of Google Cloud Kubernetes and its capability to scale both horizontally and vertically.

I didn't have any experience with Kubernetes when I started this project, so it was a steep learning curve for me understanding the relationships between nodes, pods, containers, deployments, and services.

I created my Kubernetes cluster with a node pool specifying that I want it to be horizontally and vertically scalable. In the minimal configuration, when there is no workload, my cluster runs a little preemptible virtual machine. When video orders start flowing in, Kubernetes provisions additional pods of my rendering service. When the number of pods becomes too large, Kubernetes spins up additional nodes to allocate new pods there. If an order comes in with a lengthy video that requires more computing power and memory, Kubernetes spins up a more powerful VM according to the limits I had predefined.

Such a setup is incredibly cost-effective and powerful to scale pretty much infinitely.

To orchestrate the video rending jobs, I set up Celery using Google Cloud Memorystore – a managed Redis service – as a synchronisation backend.

After the order status in Firestore gets changed to "Rendering Started", the Cloud Function sends the order details to my endpoint in AppEngine. The AppEngine function creates an entry in Celery.

Celery triggers the job in Kubernetes that pulls the video and the captions file from Cloud Storage and launches FFmpeg to render the video. The completed video gets uploaded to Cloud Storage, and the rendering service calls a Cloud Function, which updates the order status to "Rendering Completed" and sends the user a notification email.

The user can watch how the status of the order changes in real-time in her account on the website without refreshing the page. Firestore can notify subscribers – our website in this case – about any changes that happen in the database.

Accepting Payments with Stripe

To accept payments for the Captionly orders, I built an integration with Stripe using their powerful and very flexible Python API and ReactJS elements for the payment form.

I wanted the payment form to look very natural on the website and also support subscriptions, as well as Apple Pay and Google Pay.

Payment Form at Captionly using Stripe ReactJS Elements

It required me to set up an additional endpoint to listen to events sent by Stripe when payments get processed.

Such setup allowed me to stay PCI compliant and satisfy SCA requirements by not storing or processing user payment details at all by myself but rely on Stripe.

It's Your Turn!

This experience of building a fully serverless infrastructure paired with a scalable Kubernetes service made me even more convinced that we have incredible power at our disposal to build anything we can imagine.

The trick is to be able to find problems – that's the hardest bit!

I encourage you to experiment yourself with Cloud Services because that's how you come to realise what is possible to build. It also helps you keep your technical skills at peak and lets you quickly prototype with your ideas.

In the end, is there anything more exciting than seeing your ideas come to life?

If you've read this far, I invite you to give Captionly a try using this 25% discount link valid for any subscriptions that we offer.

If you wonder what to talk about, here's a short video on how to get ideas for your videos! In addition, check out my Instagram @dimileeh to watch videos I created using Captionly. Good luck!

How to Get Ideas for Your Videos

Biohacking: My Experiments with Nicotine

Dmitri Lihhatsov — Sat, 03 Aug 2019 20:32:00 GMT

The questions you ask determine the results and the achievements you get in your life.

Following this philosophy, a few months ago, I started to question what contributes to productivity, creativity, and tolerance to stress.

Isn't that what you, my reader, would want as well?

My search brought me to the discovery of two books:

That's how I learned about nootropics – psychoactive drugs, chemical substances that elevate mental performance in one way or another.

There is a whole world of nootropics with effects ranging from increased focus and attention to improved memory and creativity. If you've drunk coffee, you've also benefitted from caffeine, one of the most freely available nootropics.

Caffeine affects our brains by blocking out adenosine – a "tiredness" hormone that our bodies produce. Paired with L-theanine, an amino acid that occurs naturally in green and black tea, caffeine can deliver a calm state of focus and alertness. However, that's another story.

Today I want to talk about another nootropic called nicotine.

Nicotine has got a negative association due to its connection with smoking. Extensive research has proven that smoking tobacco has got a detrimental effect on a human's health. Hence the tobacco companies are forced to put truly horrible pictures on the cigarette packs (the effectiveness of which is quite questionable).

However, it is not nicotine but the poisons from the tar in the cigarettes and the carbon monoxide that cause a wide range of negative consequences on a human's heart, brain and lungs.

In its pure form, nicotine is a remarkable nootropic. Consumed in micro-doses of 1‑2 mg, 1‑4 times a day, it can produce a noticeable effect on mental performance.

The European Commission's Scientific Committee put it this way:

"In the brain, nicotine is clearly a stimulant at low doses. It produces a pattern of alertness in the electroencephalogram (EEG), mediates fast synaptic transmission, and positively modulates a range of cognitive functions. As a result, it improves attention, learning, arousal, motor skill, facilitates memory functions and decreases irritability and anxiety, among other central nervous system (CNS) functions (Balfour and Fagerström 1996, Benowitz 2008, Fattinger et al. 1997, Grybko et al. 2010)."

How does this sound?

Photo by Dustin Lee / Unsplash

I'm sitting at my desk with a blank page opened on my MacBook. After spraying Nicorette under my tongue, in 10-20 seconds, my attention on the screen becomes so focused that the background and the surroundings dissolve.

My fingertips start typing, as if by themselves. The thoughts that have been roaming in my head are now flowing freely in an orderly fashion. The effect lasts for approximately 5‑10 minutes, but as you would probably agree - starting to write an article or the first slide of a presentation are the most challenging parts. Once the opening sentences are out, it gets much more comfortable. Nicotine has been helping me tremendously with this initial block.

My experiments with taking a dose of nicotine before a meeting with my colleagues and just before giving a talk on stage did not turn out to be as successful. While I experienced the same lightness and fluidity in my speech, they weren't as helpful as in writing, though I'm yet to experiment a bit more under such circumstances.

It's worth pointing out that such effect of nicotine is achieved only in the micro-doses of 1‑2 mg.

Once, when I almost entirely depleted my Nicorette spray, I pushed it perhaps ten times, aiming to get the remaining drops out of it. About 15 minutes later, I started to feel myself on the brink of passing out. It was the very first time I experienced something like that. I can relate to what I read in various research papers: nausea, dizziness and the loss of orientation in space and time. Fortunately, about half-an-hour later, such effects passed away, and I was back to normal. Though I learned a lesson: self-regulation and discipline are crucial when experimenting with nootropics.

Nicotine is also known to develop tolerance and addiction. I cannot say whether I've developed an addiction to nicotine using the Nicorette spray. In the end, I use it only when I feel myself stuck on a problem or when writing something and only when I see the Nicorette spray lying on my desk. I don't crave for it, and I can easily let go of the desire to take a dose of nicotine. European Commission also concludes that "the addictiveness of nicotine is not directly linear with the dose" and "the mechanisms of addiction are still poorly understood".

However, what I did notice is the developed tolerance to nicotine. If in the early days of my experiments I achieved the desired level of lightness in my head with just one spray – 1 mg of nicotine, after a few months of such occasional use, I now need two-three sprays to experience the same effect without negative consequences I mentioned earlier.

As a result, I've now decided to get by without nicotine for at least a month. Right now, I'm writing this article under the effect of the last few drops from the spray. I will only crave for a dose when I am working on something like a presentation or another article.

I hope that after some time without nicotine, my body's tolerance to this chemical will return to normal levels, and I will resume the consumption of it. In the end, why would I not want to have an edge as I pursue my goals?

Should you decide to experiment with nootropics, I urge you to do thorough research, weight the risk of your actions and consult with your doctor if necessary.

This world is an incredible place to live, and there's so much to learn about it.

The questions you ask in your life and the search for the answers will lead you to achieve whatever goals you set for yourself.

Overcoming Creative Block

Driving Sales Through Customer‑centric Actionable Data Insights

Dmitri Lihhatsov — Tue, 06 Jun 2017 14:00:00 GMT

Whether you are looking for a way to bring up your plummeting sales, increase the rate of your already sustainable growth, or exploring new ways for the IT technology to make your business more nimble and competitive, you will always find a reliable solution within the data that your business has been generating.

However, data without disciplined analytics is just a pile of figures. Paraphrasing one of my favourite modern thinkers Charlie Munger, making strategic decisions without analytics, as a business owner, you go through life like a one-legged man in an ass-kicking contest!

But let’s step back a little bit and ask ourselves the following question:

If we had any piece of information, however unbelievable, to make the perfect decision, what would it be?

A study conducted by Google and Harvard Business Review in 2016 suggests that such thinking should precede any analytics. It is by far easier to approach analytics by defining a business problem rather than asking a bunch of geeks to stare at your data and try to infer some meaningful insights out of it. Business directors still face the same strategic decisions that have always been on their agenda. What makes the difference now is the abundance of data and computing power that help to make these decisions.

“The primary challenge to achieving a competitive advantage with analytics is moving from data to insight followed by action. An effective analytics strategy starts with a clear definition of business problems, an understanding of what the analytics solutions need to accomplish, and a metric to measure the outcomes. These definitions can determine what data is required and when. Without a disciplined approach to analytics, you will end up with customer experiences that aren’t as effective or engaging as they could be. Like any source of information, you need to embed and ingrain analytics into decision-making processes to obtain the desired results.”

Top performing companies use analytics not only to improve targeted marketing efficiency or effectiveness. While it is important to increase return on every spent marketing dollar, top performers go even further. They aim at creating value to customers at every touch point that matters. The goal is to gain a single view of the customer, ensure the optimal product offering and the most effective communication strategies. To achieve that, they capture data at every single point of interaction with the customer. From email responses and Facebook likes to tweets, polls and purchase history. Marketing needs to connect the dots across the whole company and its partners to deliver real value instead of just communicating the brand.

Luckily, artificial intelligence – AI – can help connecting these dots. Research by Forrester revealed that when asked about contextual marketing, most marketers think about targeting or programmatic media buying campaigns. However, this is not the full picture. While customer acquisition is of great importance, it is vital to retain customers as well. Interacting with them at every step of their journey, propelling them to the next best interaction based on insights derived from customer profiles, relationship history, and situational context is where AI makes the difference. AI can stick together disparate data sources, reduce time and complexity of turning them into actionable insights.

AI is not meant to be another system that would complicate your technology stack even further. Instead, it is intended to simplify and automate your processes. “AI-driven marketing promises autonomy and ongoing improvement throughout the customer journey, and delivers powerful insights with less manual time and bandwidth.”

Companies around the world are already reaping the benefits of AI-infused marketing. A Harley-Davidson dealership in New York saw a whopping 2,930% increase in leads forcing them to open a second call centre to handle all the new business. Cosabella, an Italian manufacturer of luxury lingerie, following a period of falling sales and the adoption of AI, benefited from 155% quarterly revenue increase and 565% improvements in return on ad spend. By the end of the third month after integrating AI into their ecosystem, their Facebook conversions rose by 2,000%. Both companies used AI software named Albert. In-house marketers set parameters like location, channel, target audience, budget and KPI’s leaving every other decision to Albert, such as identifying the best keywords, shifting budget between channels, finding fraud and executing buys. In other words, Albert entirely replaced their advertising agency.

According to a strategy research firm Innosight, the average lifespan of a typical company is down from 67 years to just 18 – and at current churn rates, 75% of the S&P 500 will be replaced by 2027. Disruption is the primary factor behind this trend. Companies that embrace digital transformation dramatically increase their chances of survival while watching customers deserting the ones that are lagging behind.