Hexadecimal representation of negative integers #235

polyvertex · 2015-11-26T14:06:00Z

When using format string {:#08x} to display the 32bit integer value -2147023083 in hexadecimal, I get -0x7ff8f8eb. While I understand this result is mathematically correct, I was expecting to read 0x80070715 instead since I usually use hexadecimal output to get a more intuitive representation of a variable in memory. Is there an option (compile-time or runtime) to change this default behavior without having to explicitly cast my arguments?

In case it matters, I'm using release 1.1.0.

The text was updated successfully, but these errors were encountered:

vitaut · 2015-11-26T17:06:10Z

No, there is no such option, but there are some alternatives:

Use fmt::printf/fmt::sprintf which don't print the sign

fmt::printf("%#08x", -2147023083); // prints 0x80070715

Add a helper function to do the cast (a bit easier than casting every time):

unsigned UInt(int value) {
  return static_cast<unsigned>(value);
}
fmt::print("{:#08x}", UInt(-2147023083));

vitaut · 2015-12-01T16:50:43Z

@polyvertex Do any of the proposed alternatives work for you?

polyvertex · 2015-12-01T17:25:02Z

The fmt::sprintf alternative doesn't blend well with the existing code and casting is something I would like to avoid while I admittedly was already doing that before posting here. I considered modifying cppformat to offer this feature as an option and eventually make a pull request if that would be of interest to anyone, but it appears this modification may be quite invasive.

Out of curiosity, how did you make the choice to behave differently than std::printf for that case? I essentially used wrappers over std::sprintf and friends to format strings before intensively migrating to your library so I'm not aware of how other formatting libraries behave. Hence my surprise when I stumbled upon this result.

vitaut · 2015-12-02T15:31:05Z

Out of curiosity, how did you make the choice to behave differently than std::printf for that case?

fmt::format and fmt::print are based on Python's str.format and mostly follow its conventions:

>>> '{:x}'.format(-42)
'-2a'

while fmt::sprintf and fmt::printf follow std::printf's.

It's hard to say which one is the best. I think it's reasonable to format signed integers with a sign regardless of the base, but from your comments I see that the other option might be useful too.

One possibility is to pass a custom argument formatter as a template argument to BasicFormatter. Then you could easily implement your custom formatting functions that do the necessary conversions to unsigned, something like:

class CustomArgFormatter : public fmt::BasicArgFormatter<CustomArgFormatter, char>  {
 public:
  typedef fmt::BasicArgFormatter<CustomArgFormatter, char> Base;
  CustomArgFormatter(fmt::BasicFormatter<char, CustomArgFormatter> &f, fmt::FormatSpec &s, const char *)
  : fmt::BasicArgFormatter<CustomArgFormatter, char>(f.writer(), s) {}

  void visit_int(int value) {
    fmt::FormatSpec &spec = this->spec();
    if (spec.type() == 'x')
      visit_uint(value); // convert to unsigned and format
    else
      Base::visit_int(value);
  }
};

std::string format(const char *format_str, fmt::ArgList args) {
  fmt::MemoryWriter writer;
  fmt::BasicFormatter<char, CustomArgFormatter> formatter(args, writer);
  formatter.format(format_str);
  return writer.str();
}
FMT_VARIADIC(std::string, format, const char *)

This will require some changes to the library. I did a quick and dirty prototype to see that this is feasible and if you interested I can clean it up and push the changes.

polyvertex · 2015-12-03T14:20:02Z

IMO the custom argument formatter thing would be great as it would save the library from the creation of an exotic option. This would perfectly fit my needs actually since I would also like to interpret the HASH_FLAG differently than the default behavior.

Would that be feasible to use this custom formatter along/integrated with a custom Writer as described in #140? This is the way I use cppformat mainly.

PS: I'm still amazed by the amount of time you spend maintaining and improving your library as well as offering close support. As a user, I just hope it won't become a bloat of features!

vitaut · 2015-12-04T16:56:29Z

IMO the custom argument formatter thing would be great as it would save the library from the creation of an exotic option. This would perfectly fit my needs actually since I would also like to interpret the HASH_FLAG differently than the default behavior.

Cool, let's go with this option then. I've done some preliminary support on custom formatters, but still need to make argument formatter customizable.

Would that be feasible to use this custom formatter along/integrated with a custom Writer as described in #140?

BasicFormatter can use any writer derived from BasicWriter, so I don't see any problem with this.

Thanks for the nice feedback =). I do try to keep the library lean and avoid feature creep. Therefore providing a facility to implement some feature is preferred to adding this feature directly (unless it is something that is likely to be widely used).

vitaut · 2016-03-19T14:52:05Z

Mostly done in 52f8906. Just need to move the argument formatter class from fmt::internal to fmt namespace and document the new functionality.

vitaut · 2016-03-19T15:36:46Z

Here's a working example:

#include "format.h"

class TrumpFormatter : public fmt::internal::ArgFormatterBase<TrumpFormatter, char>  {
 public:
  typedef fmt::internal::ArgFormatterBase<TrumpFormatter, char> Base;

  TrumpFormatter(fmt::BasicFormatter<char, TrumpFormatter> &f, fmt::FormatSpec &s, const char *)
  : fmt::internal::ArgFormatterBase<TrumpFormatter, char>(f.writer(), s) {}

  void visit_cstring(const char *str) {
    if (std::strcmp(str, "Trump") == 0)
      str = "Someone With Tiny Hands";
    Base::visit_cstring(str);
  }
};

void trump_print(const char *format_str, fmt::ArgList args) {
  fmt::MemoryWriter writer;
  fmt::BasicFormatter<char, TrumpFormatter> formatter(args, writer);
  formatter.format(format_str);
  std::puts(writer.c_str());
}
FMT_VARIADIC(void, trump_print, const char *)

int main() {
  trump_print("{} has his \"Make America Great Again\" hats.", "Trump");
}

It prints:

Someone With Tiny Hands has his "Make America Great Again" hats.

polyvertex · 2016-03-22T13:15:19Z

It feels like Christmas :)

MrSapps · 2016-03-23T15:30:17Z

Edit: Solved earlier issue.

Can this work with {} style strings? The base class seems to be looking for printf style format strings?

vitaut · 2016-03-23T18:17:59Z

This works with Python-like {} format strings (please see the example above). ArgFormatterBase handles common format specifiers while PrintfArgFormatter implements printf-specific adjustments. Since the two formats have a lot in common PrintfArgFormatter has very little work to do.

MrSapps · 2016-03-24T09:25:24Z

Not sure what I was doing yesterday, it clearly works with {} strings as you say! I'm trying to do this:

void visit_int(int value)
    {
        if (spec().type() == 'X')
        {
            // Format all signed hex values as signed
            visit_uint(value);
        }
        else
        {
            Base::visit_int(value);
        }
    }

    void visit_uint(unsigned int value)
    {
        if (spec().type() == 'X')
        {
            // Always add "0x" if the user didn't specify it
            if (!spec().flag(fmt::HASH_FLAG))
            {
                mWriter << "0x";
            }

            // Pad with 0's up to the "size" of the type
            spec().width_ = sizeof(value) * 2;
            spec().fill_ = '0';
        }

        Base::visit_uint(value);
    }

This can work for int/unsigned int and __int64/unsigned _int64. However for short int and unsigned short int there is no visit_shortint function, is there a way to get the size of the type in bytes? I'd like this so I can set the spec().width = sizeof(value) * 2; correctly so that short int t = 0xde appears as 0x00DE instead of 0x000000DE.

vitaut · 2016-03-24T13:22:06Z

short is promoted to int as in printf, so you can't check if the original value was short unfortunately.

This allows providing custom argument formatters without relying on internal APIs (#235).

vitaut · 2016-04-20T14:24:47Z

BasicArgFormatter is now public and the example in #235 (comment) works, so there is no need to rely on internal APIs any more.

Here's the first stab at the docs: http://cppformat.github.io/dev/api.html#argument-formatters.

vitaut · 2016-04-21T14:38:57Z

I think there should be enough information in http://cppformat.github.io/dev/api.html#argument-formatters now to implement custom argument formatters and there is an example that covers specifically hexadecimal representation of negative integers. Therefore I'm closing this.

polyvertex · 2016-04-22T07:43:26Z

This is great, really. I haven't had the chance to try it yet (neither I had the opportunity to ever implement the curiously recurring template pattern! which probably means I should think my designs twice) but according to your docs, all the ingredients I was missing and hopping for are there. You've made a user happy!

vitaut · 2016-04-23T02:48:26Z

Thanks for the nice feedback. =)

polyvertex · 2016-11-01T11:16:03Z

I finally had a chance to give it a try. It's exactly what I wished to achieve, thanks

Leandros · 2017-10-16T10:09:44Z

This is the wrong behaviour. Hexadecimal should never print negative. How can I reset the behaviour (without always casting to an unsigned type)?

vitaut · 2017-10-21T17:22:51Z

Hexadecimal should never print negative.

@Leandros, why? It seems reasonable to expect signed integer to be formatted with a sign regardless of the base. Could you elaborate a bit on your use case.

How can I reset the behaviour (without always casting to an unsigned type)?

You can implement your own formatting function that formats signed integers without sign in hexadecimal as explained above.

polyvertex · 2017-10-21T18:58:09Z

To help @Leandros a bit, here is an arg formatter I use:

// An fmt::BasicArgFormatter that forcefully casts integers to unsigned when
// the 'x' or 'X' type specifier is used.
template <typename Char, typename Spec = format_spec>
class arg_formatter : public ::fmt::BasicArgFormatter<arg_formatter<Char>, Char, Spec>
{
public:
    typedef ::fmt::BasicArgFormatter<arg_formatter<Char>, Char, Spec> SuperT;

    arg_formatter<Char, Spec>(
            ::fmt::BasicFormatter<Char, arg_formatter>& f,
            typename Spec& s, const Char* fmt)
        : SuperT(f, s, fmt) { }

    template <typename T>
    void visit_any_int(T value)
    {
        const char type_lc = this->spec().type() | 0x20; // lower case
        if (!std::is_unsigned<T>::value && (type_lc == 'x')) // || type_lc == 'b'))
        {
            typedef std::make_unsigned<T>::type unsigned_type;
            SuperT::visit_any_int<unsigned_type>(value);
        }
        else
        {
            SuperT::visit_any_int<T>(value);
        }
    }
};

Leandros · 2017-10-21T19:53:27Z

@vitaut Might just be me, but I connect hexadecimal output with printing the memory behind the variable. It, hence, makes zero sense to print the sign. And it also might come from me being used to use printf, where %x is doing exactly what I described:

#include <stdio.h>
int main(void) 
{
	int x = -255;
	printf("%x\n", x);
	return 0;
}

will output:

ffffff01

https://ideone.com/62jFyu

vitaut · 2017-10-22T17:34:23Z

I don't have a very strong opinion on what the default behavior should be and will accept a PR to change it. However, since changing the default formatting may break clients' expectations, I suggest doing it in the std branch that will become the basis of the next major release.

polyvertex changed the title ~~Hexadecimal for negative integers~~ Hexadecimal representation of negative integers Nov 26, 2015

vitaut added a commit that referenced this issue Apr 19, 2016

Make BasicArgFormatter public and add ArgFormatter

b69e6dc

This allows providing custom argument formatters without relying on internal APIs (#235).

vitaut closed this as completed Apr 21, 2016

vitaut mentioned this issue May 26, 2016

Request: Custom argument formatting for PrintfFormatter #335

Closed

ringabout mentioned this issue Jul 4, 2021

echo -0x80'i8 prints 128 (which is out of range) and other bugs nim-lang/Nim#18422

Open

adepke mentioned this issue Nov 25, 2021

HRESULT log formatting adepke/VanguardEngine#54

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hexadecimal representation of negative integers #235

Hexadecimal representation of negative integers #235

polyvertex commented Nov 26, 2015

vitaut commented Nov 26, 2015

vitaut commented Dec 1, 2015

polyvertex commented Dec 1, 2015

vitaut commented Dec 2, 2015

polyvertex commented Dec 3, 2015

vitaut commented Dec 4, 2015

vitaut commented Mar 19, 2016

vitaut commented Mar 19, 2016

polyvertex commented Mar 22, 2016

MrSapps commented Mar 23, 2016

vitaut commented Mar 23, 2016

MrSapps commented Mar 24, 2016

vitaut commented Mar 24, 2016

vitaut commented Apr 20, 2016

vitaut commented Apr 21, 2016

polyvertex commented Apr 22, 2016

vitaut commented Apr 23, 2016

polyvertex commented Nov 1, 2016

Leandros commented Oct 16, 2017 •

edited

Loading

vitaut commented Oct 21, 2017

polyvertex commented Oct 21, 2017

Leandros commented Oct 21, 2017 •

edited

Loading

vitaut commented Oct 22, 2017

Hexadecimal representation of negative integers #235

Hexadecimal representation of negative integers #235

Comments

polyvertex commented Nov 26, 2015

vitaut commented Nov 26, 2015

vitaut commented Dec 1, 2015

polyvertex commented Dec 1, 2015

vitaut commented Dec 2, 2015

polyvertex commented Dec 3, 2015

vitaut commented Dec 4, 2015

vitaut commented Mar 19, 2016

vitaut commented Mar 19, 2016

polyvertex commented Mar 22, 2016

MrSapps commented Mar 23, 2016

vitaut commented Mar 23, 2016

MrSapps commented Mar 24, 2016

vitaut commented Mar 24, 2016

vitaut commented Apr 20, 2016

vitaut commented Apr 21, 2016

polyvertex commented Apr 22, 2016

vitaut commented Apr 23, 2016

polyvertex commented Nov 1, 2016

Leandros commented Oct 16, 2017 • edited Loading

vitaut commented Oct 21, 2017

polyvertex commented Oct 21, 2017

Leandros commented Oct 21, 2017 • edited Loading

vitaut commented Oct 22, 2017

Leandros commented Oct 16, 2017 •

edited

Loading

Leandros commented Oct 21, 2017 •

edited

Loading